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SYKTHBTIC DN& SEQUEHCS HSVZHG SHHftHCSD 
IHSECTIC7DAL ACTTVTTY IN M&IZB 

This application is a concinuarion in parr applicarion 
of U.S. serial no. 772,027 filed October 4, 1991, which 
disclosure is herein incorporated in its entirety. 

Field of the Invention 

The present invention relates to DNA sequences encoding 
insecticidal proteins, and expression of these sequences in 
plants. 

3ac*cground of the Invention 
Zxpressioi* of the insecticidal protein (IP) genes 
derived from Bacillus thurinqiensis (Bt) in plants has proven 
extremely difficult. Attempts have been made to express 
chimeric promoter/Bt IP gene combinations in plants. 
Typically, only low levels of protein have been obtained in 
transgenic plants. See , for example, Vaecic et al.. Nature 
228:33-37, 1987; Barton et al., Plant Phvsiol. 85:1103-1109, 
1987; Fischoff et al., Bio/Technoloov 5:807-813, 1987. 

One postulated explanation for the cause of low 
expression is that fortuitious transcription processing sites 
produce aberrant forms of 3t IP mRNA transcript. These 
aberrantly processed transcripts are non- functional in a plant, 
in terms of producing an insecticidal protein. Possible 
processing sites include polyadenylation sites, intron splicing 
sites, transcriptional termination signals and transport 
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signals. Mosr genes do nor contain sites tiiat wiii 
deieteriousiy affect gene expression in that gene's normal host 
organism. However, the fortuitous occurrence of such 
processing sites in a coding region might complicate the 
expression of that gene in transgenic hosts. For example , the 
coding region for the 3t insecticidal crystal protein gene 
derived from Bacillus thurinaiensis strain tairstaki (GENBANK 
3THKDRHD, accession M1S271^ thurinaiensis var. jcurstaJci , 
HD-1; Geiser et al. Gene 48:109-118 (1986)) as derived directly 
from Bacillus thurinaiensis , might contain sites which prevent 
this gene from being properly processed in plants. 

Fxirther difficulties exist when attempting to express 
Bacillus thurinaiensis protein in an organism such as a plant. 
It has been discovered that the codon usage of a native at IP 
gene is significantly different from that which is typical of a 
plant gene. In particular, the codon usage of a native 3t IP 
gene is very different from that cf a maise gene. As a result, 
the mBKA from this gene may not be efficiently utilized. Codon 
usage might influence the expression of genes at the level of 
translation or transcription or mRMA processing. To optimize 
an insecticidal gene for expression in plants, attempts have 
been made to alter the gene to resemble, as much as possible, 
genes naturally contained within the host plant to be 
transformed. 

Adang et al., EP 03S9472 (1990), relates to a synthetic 
Bacillus thurinaiensis tenebrionis (Btt) gene which is 35% 
homologous to the native Btt gene and which is designed to have 
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an A+T content approximating that found in plants in general. 
Table 1 of Adang et al. show the codon sequence of a synthetic 
Btt gene which was made to resemble more closely the normal 
codon distribution of dicot genes. Adang et al. state that a 
synthetic gene coding for IP can be optimised for enhanced 
expression in monocot plants through similar methods, 
presenting the frequency of codon usage of highly expressed 
monocot proteins in Table 1. At page 9, Adang et al. state 
that the synthetic Btt gene is designed to have an A<t-T content 
of 55% (and, by implication, a G4^C content of 45%) . At page 
20, Adang et al. disclose that the synthetic gene is designed 
by altering individual amino acid codons in the native 3t gene 
to reflect the overall distribution of codons preferred by 
dicot genes for each amino acid within the coding region of the 
gene. Adang et al. further state that only some of the native 
Btt gene codons will be replaced by the most preferred plant 
codon for each amino acia, such that the overall oistribution 
of codons used in dicot proteins is preserved. 

Fischhoff et al., SF 0 385 962 (1990), relates to plant 
genes encoding the crystal protein toxin of Bacillus 
thurinaiensis . At table V, Fischhoff et al. disclose percent 
usages for codons for each amino acid. At page 8, Fischoff et 
al. suggest modifying a native 3t gene by removal of putative 
polyadenylation signals and ATTTA sequences. Fischoff et al. 
further suggest scanning the native Bt gene sequence for 
regions with greater than four consecutive adenine or thymine 
nucleotides to identify putative plant polyadenylation signals. 
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Fisclioff er al« scace thaz zhe nucleotiide sequence saould be 
altered if more than one purarive polyadenylatiion signal is 
identified within ten nucleotid s of each, other. At pag 9, 
Fischoff er al. state that efforts should be aade to select 
codons to preferably adjust the G^C content to about 50%. 

Perlaic et al., PNAS USA > 88:3324-3328 (1991), relates 
to modified coding sequences of the Bacillus thurinaiensis 
crylA(b) gene, similar to those shown in Fischoff et al. As 
shown in table 1 at page 3325, the partially modified crylA(b) 
gene of Perlaic et al. is approximately 96% homologous to the 
native crylA(b) gene (1681 of 1743 nucleotides) , with a GH: 
content of 41%, number of plant polyadenylation signal 
sequences (PPSS) reduced from 18 to 7 and number of AITTA 
sequences reduced from 13 to 7, The fully modified cr7lA(b) 
gene of Perlak et al. is disclosed to be fully synthetic (page 
3325, column 1) . This gene is approximately 79% homologous to 
the native crylA(b) gene (1455 of 1345 nucleotides), with a G+C 
content of 49%, nismber of plant polyadenylation signal 
sequences (PPSS) reduced to 1 and all ATT7A sequences removed. 

Barton et al., SP 0431 329 (1991), relates to the 
expression of insecticidal toxins in plants. At column 10, 
Barton et al. describe the construction of a synthetic AalT 
insect toxin gene encoding a scorpion toxin using the most 
preferred codon for each amino acid according to the chart 
shown in Figure 1 of the document. 

Summary of the Invention 
The present invention is drawn to methods for enhancing 
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expression of hecerologous genes in planr cells. Generally, a 
gene or coding region of inreresr is consrrucred to provide a 
planr specific preferred codon sequence. In t^is manner, codon 
usage for a parricular prorein is altered to increase 
expression in a parricular planr. Such planr oprimized coding 
sequences can be operably liniced to promoters capable of 
directing expression of the coding seqence in a planr cell. 

Specifically, it is one of tiie objects of the present 
invention to provide synrheric insecricidal prorein genes wiiicix 
have been oprimized for expression in planrs. 

It is another object of the present invenrion to 
provide synrheric Bt insecricidal protein genes to maximize the 
expression of Bt proteins in a plants preferably in a maize 
planr. It is one fearure of rhe presenr invention rhar a 
synrheric 3t IP gene is constructed using the most preferred 
maize codonsr except for alterations necessary to provide 
ligation sites for construction of the full synthetic 
gene. 

* 

According to the above objects r we have synthesized Bt 
insecticidal crystal prorein genes in which the codon usage has 
been altered in order to increase expression in plants r 
particularly maize. However , rather than alter the codon usage 
to resemble a maize gene in terms of overall codon 
distribution, we have optimized the codon usage by using the 
codons which are most preferred in maize (maize preferred 
codons) in the synthesis of the synthetic gene. The optimized 
maize preferred codon usage is effective for expression of high 
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levels of zhe 3t iasecrlcidal protein. This might be rhe 
result of aiaximizing the am unt of 3t insecticidal protein 
translated from a given popxilation of messenger RNAs. The 
synthesis of a Bt IP gene using maize preferred codons also 
tends to eliminate fortuitous processing sites that might occur 
in the native coding sequence. The expression of this 
synthetic gene is significantly higher in maize cells th a n that 
of the native IP Bt gene. 

Preferred synthetic , maize optimized DNA sequences of 
the present invention derive from the protein encoded by the 
crylA(b) gene in Bacillus thurinoiensis var. Jcurstakir HD-1; 
Geiser et al., Gene, 48:109*118 (1986) or the cry IB gene (AKZl 
Crya4 gene) described by 3rizzard and Wbiteley, Nuc . Acids . 
Res, , 16:2723 (1988) . The DtQl sequence of the native ImrsnakL 
HD-1 crylA(b) gene is shown as Sequence 1. These proteins are 
active against various lepidopteran insects r including Ostrinia 
ntibilaliS r the European Com Borer. 

%ile the present invention has been exemplified by the 
S3^thesis of maise optimized protein genes f it is recognized 
that the method can be utilized to optimise expression of any 
protein in plants. 

The instant optimized genes can be fused t^ith a variety 
of promoters r including constitutive r inducible, temporally 
regulated, developmentally regulated, tissue-preferred and 
tissue**specific promoters to prepare recombinant ONA molecules, 
i.e., chimeric genes. The maize optimized gene (coding 
sequence) provides substantially higher levels of expression in 
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a transformed plant, when compared witii a non-maise optimized 
gene. Accordingly, plants resistant to Coleopteran or 
Lepidopteran pests, such as European com borer and sugarcane 
borer f can be produced. 

It is another object of the present invention to 
provide tissue-preferred and tissue*specific promoters which 
drive the expression of an operatively associated structural 
gene of interest in a specific part or parts of a plant to the 
substantial exclusion of other parts. 

It is another object of the present invention to 
provide pith-preferred promoters . By "pith-preferred, " it is 
intended that the promoter is capable of directing the 
expression of an operatively associated structural gene in 
greater abxindance in the pith of a plant than in the roots, 
outer sheath, and brace roots, and with sribstantially no 
expression in seed. 

It is yet another object of this invention to provide 
pollen-specific promoters. By "pollen-specific," it is 
intended that the promoter is capable of directing the 
expression of an operatively associated structural gene of 
interest substantially exclusively in the pollen of a plant, 
with negligible expression in any other plant part. By 
"negligible," it is meant functionally insignificant. 

It is yet another object of the present invention to 
provide recombinauit DNA molecules comprising a tissue-preferred 
promoter or tissue-specific promoter operably associated or 
linked to a structural gene of interest, particularly a 
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srrucrural gene encoding an insecricidal protein/ and 
expression of recombinant molecul in a plant. 

It is a further object of the present invention to 
provide transgenic plants which express at least one structural 
gene of interest operatively in a tissue**preferred or 
tissue-*specific expression pattern. 

In one specific embodimant of the invention disclosed 
and claimed herein, the tissue-preferred or tissue-specific 
promoter is operably linked to a structural gene encoding an 
insecticidal protein, and a plant is stably transformed with at 
least one such recombinant molecule. The resultant plant will 
be resistant to particular insects which feed on those parts of 
the plant in which the gene(s) is (are) expressed. Preferred 
structural genes ancode B.t. insecticidal proteins. Mdre 
preferred are maize optimized 3.t. IP genes. 

Brief Description of the Figures 
?ig. 1 is a comparison of the full-length native Bt 
crylA(b) gene [BTHKORHD], a full-length synthetic maize 
optimized Bt crylA(b) gene [ f Isynbt . f in ] and a truncated 
synthetic maize optimized 3t crylA(b) gene [bssyn] . This 
figure shows that the full-length synthetic maize optimized 
crylA(b) gene sequence matches that of the native crylA(b) gene 
at about 2354 out of 3468 nucleotides (approximately 68% 
homology) » 

Fig* 2 is a comparison of the truncated native Bt 
cryia.{b) gene [BTHKDRHD] and a truncated synthetic maize 

8 
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opriaized 3t gene [bssyn] . This figure shows tihat the 
truncated synthetic maize optimized crylA(b) gene sequence 
matches that of the native crylA(b) gene at about 1278 out of 
1947 nucleotides (approximately 66% homology) . 

Fig, 3 is a comparison of the pure maize optimized 3t 
gene sequence [synlT.mze] with a truncated synthetic maize 
optimized 3t gene [bssyn] and a full-length synthetic maize 
optimized 3t gene modified to include restriction sites for 
facilitating construction of the gene [synful.mod] . This 
figure shows that the truncated synthetic maize optimized 
crylA(b) gene sequence matches that of the pure maize optimized 
crylA(b) gene at 1913 out of 1947 nucleotides (approximately 
98% homology) . 

Fig. 4 is a comparison of a native truncated 3t 
crylA(b) gene [BTHKDRHD] with a truncated synthetic crylA(b) 
gene described in Perlak et al., PNAS USA . 88:3324-3328 (1991) 
[PMONBTl and a truncated synthetic maize optimized Bt gene 
[bssyn] . This figure shows that the PMONBT gene sequence 
matches that of the native crylA(b) gene at about 1453 out of 
1845 nucleotides (approximately 79% homology)/ while the 
truncated synthetic maize optimized 3t crylA(b) gene matches 
the native crylA(b) gene at about 1209 out of 1845 nucleotides 
(approximately 56% homology) . 

Fig. 5 is a comparison of a truncated synthetic 
crylA(b) gene described in PerlaJc et al., PNAS CTSA r 
38:3324-3328 (1991) [PMONBT] and a truncated synthetic maize 
optimized St crylA(b) gene [bssyn] * This figure shows that the 
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PMONBT gene sequence matdies tiiat of rhe truncated synrh tic 
maxze optiolzed 3t crylA(b) gene at about 1410 out of 1845 
nucleotides (approximately 77% homology) • 

Fig. 6 is a full-iengthr maize optimized Cry IB gene. 

Fig. 7 is a full-lengtii, hybrid^ partially maize 
optijnized DNA sequence of a CrylA(b) gene wliicii is contained in 
PCIB4434. The synthetic region is from nucleotides 1-1938 
(amino acids 1*646), and the native region is from nucleotides 
1939-3468 (amino acids 647-1155) . The fusion point between the 
synthetic and native coding sequences is indicated by a slash 
(/) in the sequence. 

?ig« 8 is a map of pCIB4434« 

Fig. 9 is a full -lengthy hybrid, maize optimized DNA 
sequence encoding a heat stable CrylA(b) protein, contained in 
?CI355il. 

Fig. 10 is a map of pCZ35511. 

Fig. 11 is a full-length, hybrid, maize optimized DNA 
sequence encoding a heat stable CrylA(b) protein, contained in 
PCIBS512. 

Fig. 12 is a map of pCIBSS12. 

Fig* 13 is a full-length, maize optimized DNA sequence 
encoding a heat stable CrylA(b) protein, contained in pCIB5513. 
Fig. 14 is a map of pCI3SS13. 

Fig. 15 is a full-length, maize optimized DNA sequence 
encoding a heat-stable CrylA(b) gene, contained in pCIB5514. 
Fig- 16 is a map of pCiaS514. 
Fig. 17 is a map of pCI34418. 

10 
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Fig. 


18 is a map 


of ?CI34420. 




Fig. 


19 is a map 


of PCIB4429. 




Fig. 


20 is a map 


of PCI34431. 




Fig • 


21 is a map 


of PCIB4428. 




Fig. 


22 is a map 


of PCIB4430. 




Fig. 


23A is a tai 


3le containing dat:< 


i of crylA(b) protein 


n trj 


uisgenic mai: 






Fig. 


23B is a tal 


die which summarize 


£5 results of 



bioassays of Qstrinia and Diatraea on leaf material from maize 
progeny containing a maize optimized CrylA(b) gene. 

Fig. 23C is a table containing data of crylA(b) protein 
levels in transgenic maize. 

Fig. 23D is a table which summarizes the results of 
bioassays of Qstrinia and Diatraea on leaf material from maize 
progeny containing a synthetic 3t. maize gene operably linked 
to a pith promoter. 

Fig. 23E is a table containing data on expression of 
the crylA(b) gene in transgenic maize using the pith-preferred. 



Fig. 24 is a complete genomic DKZl sequence of a maize 
tryptophan synthase- alpha stibtinit gene. IntronSr exons, 
transcription and translation starts, start and stop of cDMA 
are shown • S = start and end of cDNA; +1 « transcription 
start; 73*«****« = primer exrension primer; -rl =» start of 
translation; -k-m- = stop codon; bp 1495-99 = CCAAT 3ox; bp 
1593-1598 =» TA2AA Box; bp 3720-3725 = poly A addition site; 
# above underlined sequences are PGR primers. 

11 
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Figs, 25Ar 2SB, 25C and 25D are Norrn rn biot analyses 
whicii show differential expression of tHe maire TrpA sufaunit 
gene in maize tissue at 2 hour, 4 hour, 18 hoxir, and 48 hoxir 
intervals, respectively, at -aO»C witi DuPont Cronex 
intensifying screens. P-njitH; C=»coi3; aR«brace roots; 3S»ear 
shank; L?«lower pith; MP«^ddle pith; UP»upper pith; S«seed; 
L^leaf; R»root; and P (upper left)=total pith. 

Fig. 26 is a Northern blot analysis, the two left lanes 
of which show the maise TrpA gene expression in the leaf (L) 
and pith (P) of Funk inbred lines 211D and 5N984. The five 
right lanes indicate the absence of expression in Funk 211D 
seed total SNA. S(l, 2,3, 4 and 5)= seed at 1^ 2, 3, 4 and 5 
weeks post pollenation. L-leaf ; P-pith; S#»seed ♦ weeks post 
pollenation. 

Fig. 27 is a Southern blot analysis of genomic DMl Funk 
line 211D, probed with maize TrpA cONA 8-2 (pCIBS600) , wherein 
B denotes BaaHl^ S denotes ScoRI, SV denotes BcoRV, H denotes 
HINDIII, and S denotes Sad. IX, 5X and lOX denote 
reconstructed gene copy equivalents. 

Fig. 2aA is a primer extension analysis which shows the 
transcriptional start of the maize TrpA subxaxit gene and 
sequencing ladder- Lane +1 and +2 are IX + 0.5X samples of 
primer extension reaction. 

Fig. 288 is an analysis of RNase protection from +2 bp 
to -s-387 bp at annealing temperatures of 42*C, 48 •C and S4»Cf at 
a IS hour exposure against film at -80 'C with DuPont Cronex 
intensifying screens. 

12 
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Fig. 29 is A map of the original Type II 
pollen-specific cDNA clone. The siifacloning of th three EcoRI 
fragments into pflluescript vectors to creat pCIB31S8r pCIB31S9 
and II-. 6 is illustrated. 

?ig. 30 shows the DNA sequence of the maize 
pollen-specific calcium dependent protein Icinase gene cDNAr as 
contained in the 1.0 kb and 0.5 kh fragments of the original 
Type II cDHA clone. The ScoRI site that divides the 1-0 Icb and 
0.5 ]cb fragments is indicated. This cDlOL is not full length, 
as the mRIUl start site maps 490 bp upstream of the end of the 
cDNA clone. 

Fig. 31 illustrates the tissue-specific expression of 
the pollen CDPK mfOOl. RNA from the indicated maize 211D 
tissues was denatured, electrophoresed on an agarose gel, 
transferred to nitrocellulose, and probed with the pollen CDPK 
cDNA 0.5 kh fragment. The mRMA is detectable only in the 
pollen, where a strong signal is seen. 

Fig. 32 is an amino acid sequence comparison of the 
pollen CDPK derived protein sequence and the rat protein Icinase 
2 protein sequence disclosed in Tobimatsu et al., J, Biol, 
Chem. 263:16082-16086 (1988) . The Align program of the OHAsrar 
software pacicage was used to evaluate the sequences. The 
homology to protein }cinases occurs in the 5' two thirds of the 
gene, i.e. in the 1.0 ]cb fragment. 

Fig* 33 is an amino acid sequence comparison of the 
pollen C3PX derived protein sequence and the human calmodulin 
protein seauence disclosed in Fischer et al., J. Biol. CThem. 
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253:17055-17062 (1988) . The homology co calmodulin occurs in 
the 3' one tihird of Che gene, i.e. in th 0.5 3cb fragment. 

Fig. 34 is an amino acid sequence comparison of the 
pollen CDPK derived protein sequence and soybean CDPK. The 
homology occurs over the entire gene. 

Fig. 35 illustrates the sequence of the maize 
pollen-specific CDPK gene. 1.4 3cb of sequence prior to the 
mRNA start site is shown. The positions of the seven exons and 
six introns are depicted under the corresponding DNA sequence. 
The site of polyadenylation in the cDSJi clone is indicated. 

Fig. 36 is a map of pCIB4423. 

Fig. 37 is a full-iengthf hybridr maise-optimised DNA 
sequence encoding a heat stable cryIA{b) protein. 

Fig. 38 is a map of pCIB5515. 

Description of the Sequences: 

Sequence 1 is the DNA sequence of a fullrlength native 
3t crylA(b) gene. 

Sequence 2 is the DMSi sequence of a full-length pure 
maize optimized synthetic 3t cryDl(b) gene. 

Sequence 3 is the DMA sequence of an approximately 2 Kb 
truncated synthetic maize optimized 3t crylA(b) gene. 

Sequence 4 is the DNA sequence of a full-length 
synthetic maize optimized 3t crylA(b) gene. 

Sequence 5 is the DMA sequence of an approximately 2 Kb 
synthetic 3t gene according to PerlaJc et al. 

Detailed Descriation of the Invention 

The following definitions are provided in order to 
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provide clarity with resp cr to the terms as tii y are used in 
. rile specification and claims to describe tiie present invention. 

Maize preferred codon; Preferred codon refers to the 
preference exhibited by a specific host cell in the usage of 
nucleotide codons to specify a given amino acid. The preferred 
codon for an amino acid for a particular host is the single 
codon which most frequently encodes that amino acid in that 
host. The maize preferred codon for a particular amino acid 
may be derived from Icnown gene sequences from maize. For 
example^ maize codon usage for 28 genes from maize plants are 
listed in Table 4 of Murray et al.r Nucleic Acids Research , 
17:477-498 (1989), the disclosure of which is incorporated 
herein by reference. For instance^ the maize preferred codon 
for alanine is GCC, since, according to pooled sequences of 26 
maize genes in Murray et al., suora, that codon encodes alanine 
36% of the time, compared to GCG (24%), GCA (13%), and GCTT 
(27%) . 

Pure maize optimized sequence: An optimized gene or DNA 
sequence refers to a gene in which the nucleotide sequence of a 
native gene has been modified in order to utilize preferred 
codons for maize. For example, a synthetic maize optimized Bt 
crylA(b) gene is one wherein the nucleotide sequence of the 
native Bt crylA(b) gene has been modified such that the codons 
used are the maize preferred codons, as described above. A 
pure maize optimized gene is one in which the nucleotide 
sequence comprises 100 percent of the maize preferred codon 
sequences for a particular polypeptide. For example, the pure 
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maize oprimised Bt cryrA(b) gene is one in wiiicii zhe nucleotide 
sequence comprises 100 percent maize preferred codon sequences 
and encodes a polypeptide witii tlie sam amino acid sequence as 
that produced by tlie native 3t crylA(b) gene. The pure 
nucleotide sequence of the optimized gene may be varied to 
permit manipulation of the gene, such as by altering a 
nucleotide ro create or eliminate restriction sites. The pure 
nucleotide sequence of the optimized gene may also be varied to 
eliminate potentially deleterious processing sites r such as 
potential polyadenylation sites or intron recognition sites. 

It is recognized that "partially maize optimized," 
sequences may also be utilized. By partially maize optimized, 
it is meant that the coding region of the gene is a chimeric 
(hybrid) , being comprised of sequences derived from a native 
insecticidal gene and sequences which have been optimized for 
expression in maize. A partially optimized gene expresses the 
insecticidal protein at a level sufficient to control insect 
pests r and such expression is at a higher level than achieved 
using native secpiences only. Partially maize optimized 
sequences include those which contain at least about 5% 
optimized sequences. 

Full -length 3t Genes: Refers to DNA sequences 
comprising the full nucleotide sequence necessary to encode the 
polypeptide produced by a native 3t gene. For example, the 
native 3t crylACb) gene is approximately 3. S Kb in length and 
encodes a polypeptide which is approximately 1150 amino acids 
in length. A full-length synthetic crylACb) 3t gene would be 
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ar leasr approximately 3.5 Kb in lengrJ::. 

Truncarad 3t Genes: Refers zo DNA sequences comprising 
less than zhe full nudeoride sequence necessary to encode the 
polypeptide produced by a native 3t gene^ but whicii encodes ztiB 
active toxin portion of zhB polypeptide. ?or example ^ a 
truncated synthetic 3t gene of approximately 1.9 Kb encodes tbe 
active toxin portion of the polypeptide such that the protein 
product exhibits insecticidal activity. 

Tissue-oref erred promoter : The term "tissue- 
preferred promoter"* is used to indicate that a given regulatory 
DNA sequence will promote a higher level of transcription of an 
associated structural gene or DMA coding sequence , or of 
expression of the product of the associated- gene as indicated 
by any conventional RKA or protein assay / or that a given DNA 
sequence will demonstrate some differential effect; i.e.^ that 
the transcription of the associated DNA sequences or the 
expression of a gene product is greater in some tissue than in 
all other tissues of the plant. 

"Tissue-specific promoter'* is used to indicate that a 
given regulatory DNA sequence will promote transcription of an 
associated coding DNA sequence essentially entirely in one or 
more tissues of a plant, or in one type of tissue, e.g. green 
tissue, while essentially no transcription of that associated 
coding DNA sequence will occur in all other tissues or types of 
tissues of the plant. 

The present invention provides DNA sequences optimized 
for expression in plants, especially in maize plants. In a 
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preferred embodimenr ox riie present invention, tlxe DNA 
sequences encode the production of an insecticidal toxin, 
preferably a polypeptide sharing substantially the amino acid 
sequence of an insecticidal crystal protein toxin normally 
produced by Bacillus tburinaiensis > The synthetic gene may 
encode a truncated or full-length insecticidal protein. 
Sspecially preferred are synthetic 3NA sequences which encode a 
polypeptide effective against insects of the order Lepidootera 
and ColeoPtera i> and synthetic DN21 sequences which encode a 
polypeptide having an amino acid sequence essentially the same 
as one of the crystal protein toxins of aacillus thurinqiensis 
variety 3airstaJci, HD-*1. 

The present invention provides synthetic Did sequences 
effective to yield high expression of active insecticidal 
proteins in planar preferably maise protoplasts, plant cells 
and plants. The synthetic DMA sequences of the present 
invention have been modified to resemble a maize gene in terms 
of codon usage and G+C content. As a result of these 
modifications, the synthetic DNA sequences of the present 
invention do not contain the potential processing sites which 
are present: in the native gene. The resulting synthetic DNA 
sequences (synthetic 3t I? coding sequences) and plant 
transf o rm a t: on vectors containing this synthetic DNA sequence 
(synthetic 3t IP genes) result in surprisingly increased 
expression of the synthetic at IP gene, compared to the native 
at I? gene, in terms of insecticidal protein production in 
plants, particularly maize. The high level of expression 
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results in maize cells and pianrs tiiar exfaibir resisrance ro 
lepidopreran insecrs, preferaisly Surapean Cam 3orer and 
Diarrea saceharalis > the Sugarcane 3orer. 

The syntlietic DNA sequences of tiie present invention 
are designed to encode insecticidal proteins from Bacillus 
thurinciensis , but are optimised for expression in maize in 
terms of G-fC content and codon usage. For exastple^ the maize 
codon usage table described in Murray et al.^ sugra, is used to 
reverse translate the amino acid sequence of the toxin produced 
by the Bacillus thurinciensis subsp. kurstaiti HD-1 crylA(b) 
gene, using only the most preferred maize codcns. The reverse 
translated DNA sequence is referred to as the pure maize 
optimized sequence and is shown as Sequence 4. This sequence 
is subsequently modified to eliminate unwanted restriction 
endonuclease sites r and to create desired restriction 
endonudease sites. These modifications are designed to 
facilitate cloning of the gene without appreciably altering the 
codon usage or the maize optimized sequence. During the 
cloning procedure^ in order to facilitate cloning of the gene, 
other modifications are made in a region that appears 
especially susceptible to errors induced during cloning by the 
polymerase chain reaction (PGR) . The final sequence of the 
maize optimized synthetic 3t IP gene is shown in Sequence 2. A 
comparision of the maize optimized synthetic 3t IP gene with 
the native jcurstaici cryIA<b) 3t gene is shown in Fig. 1. 

In a preferred embodiment of the present invention, the 
protein produced by the synthetic DNA sequence is effective 
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againsc insecrs of uhe order Leoidoorera or Coleotycera , In a 
more preferred ambodimem:, the polypeptide encoded by the 
synrh ric D13A sequence consists essentially of the full-length 
or a truncated amino acid sequence of an insecticidal proredLn 
normally produced by aacillus thurinaiensis var. kurstaki 
In a particular embodiment ^ the synthetic DKA. sequence encodes 
a polypeptide consisting essentially of a truncated amino acid 
sequence of the Bt CrylA(b) protein. 

The insecticidal proteins of the invention are 
expressed in a plant in an amount sufficient to control insect 
pests r i.e. insect controlling amounts. It is recognized that 
the amount of expression of insecticidal protein in a plant 
necessary to concrol insects may vary depending upon species of 
plants type of insect r environmental factors and the like. 
Generally, the insect population will be kept below the 
economic threshold which varies from plant to plant. ?Qr 
example r to control Suropean com borer in maize, the economic 
threshold is .5 eggmass/plant which translates to about 10 
larvae/plant . 

The methods of the invention are useful for controlling 
a wide variety of insects including but not limited to 
rootworms, cutworms ^ armyworasr particularly fall and beet 
araywormsr wireworms, aphids, com borers, particularly 
Suropean com borers, sugarcane borer, lesser com stalk borer, 
Southwestern com borer, etc. 

In a preferred embodiment of the present invention, the 
synthetic coding DNA sequence optimized for expression in 
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maize cQxapxLses a G+C percencage greater zhan ciiar of tiie 
nacive crylA(b) gene* It is preferred zhaz zhe G+C percentage 
be at least about 50 percent f and .nors preferably at 1 ast 
about 60 percent. It is especially preferred Zhaz zhe G-k: 
percent be about 64 percent. 

In another preferred embodiment of tbe present 
invention^ the synthetic coding DMA sequence optimised for 
expression in maize comprises a nucleotide sequence having at 
least about 90 percent homology with the "pure" maize optimized 
nucleotide sequence of the native Bacillus thurinaiensis 
crylA(b) protein, more preferably at least about 95 percent 
homology r and most preferably at least about 98 percent. 

Other preferred embodiments of the present invention 
include synthetic DMA sequences having essentially the DMA 
sequence of Sequence ID No. 4, as well as mutants or variants 
thereof; transformation vectors comprising essentially the DMA 
sequence of Sequence ID No. 4; and isoiatea DMA sequences 
derived from the plasmids pC134406, pCI34407, pCIB4413, 

pCia4414, PCIB4416, pCIB4417, pCIB4418, ?CIB4419, ?CIB4420r 
PCIB4421, PCIB4423, pCI34434, pCI34429, pCIB4431, pCIB4433. 
Most preferred are isolated DMA sequences derived from the 
plasmids pClB4418 and pCia4420, pCIB4434, ?CIB4429, pCia4431, 
and PCIB4433. 

In order to construct one of the maize optimized DMA 
sequences of the present invention, synthetic DMA 
oligonucleotides are made with an average length of about 30 
nucleotides. These oligonucleotides are designed to hybridize 
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zo produce fragmenrs comprising' zhe various quarrers of zhe 
trancated toxin gene. Tlie oligonucleotides for a given quarter 
are hybridized and amplified using PGR. The quarters are then 
cloned and the cloned quarters are sequenced to find those 
containing the desired sequences. In one instance , the fourth 
quarter^ the hybridized oligonucleotides are cloned directly 
without PGR amplification. Once all clones of four quarters 
are found which contain open reading frames, cui intact gene 
encoding the active insecticidal protein is assembled. The 
assembled gene may then be tested for insecticidal activity 
against any insect of interest including the Suropean Com 
Borer (EC3} and the sugarcane borer. (Examples 5A and 5B, 
respectively) . nhen a fully functional gene is obtained, it is 
again sequenced to confirm its primary structure. The fully 
functional gene is found to give 100% mortality when bioassayed 
against SCa. The fully functional gene is also modified for 
expression in maize. 

The maize optimized gene is tested in a transient 
expression assay ^ e.g. a maize transient expression assay. 
The native 3t crylA(b) coding sequence for the active 
insecticidal toxin is not expressed at a detectable level in a 
maize transient expression system. Thus, the level of 
expression of the synthesized gene can be determined. By the 
present methods, expression of a protein in a transformed plant 
can be increased at least about 100 fold to about 50,000 fold, 
more specifically at least about 1,000 fold to at least about 
20,000 fold. 
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Increasing expression of an insecricial gene to an 
effecrive level does nor require 3ian^pulacion of a narive gene 
along ti^e enrire sequence. Sffecrive expression can be 
achieved by manipulating only a porcion of the sequences 
necessary to obtain increased expression. A full-lengrhr maize 
optimized CrylA(b) gene may be prepared which contains a 
protein of the native CrylA(b) sequence. For example. Figure 7 
illustrates a full*lengthr maize optimized CrylA(b) gene which 
is a synthetic-native hybrid. That is, about 21cb of the gene 
(nucleotides 1-1938) is maize optimized, i.e. synthetic. The 
remainder, C-terminal nucleotides 547-1155, are identical to 
the corresponding sequence native of the CrylA(b) gene. 
Construction of the illustrated gene is described in Sxample 6, 
below . 

It is recognized that by using the methods described 
herein, a variety of synthetic/native hybrids may be 
constructed and tested for expression. The important aspect of 
hybrid construction is that the protein is produced in 
sufficient amounts to control insect pests. In this manner, 
critical regions of the gene may be id€&ntified and such regions 
synthesized using preferred codons. The synthetic sequences 
can be linked with native sequences as demonstrated in the 
Sxaaples below. Generally, N-cerminal portions or processing 
sites can be synthesized and substituted in the native coding 
sequence for enhanced expression in plants. 

In another embodiment of the present invention, the 
maize optimized genes encoding crylA(b) protein may be 
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manipulated zo render zhe encoded ororein more iieat stable or 
temperature stable coopared to th native cryIA{b) protein. It 
has been sbown tbat tiie crylA(b) gene found in Bacillus 
tliurincriensis kurstaJci HD-1 contains a 26 amino acid deletionr 
wben compared with tbe crylA(a) and cryIA(c> proteins, in the 
-COOH half of tbe protein. This deletion leads to a 
temperature-sensitive crylA(b) protein. See M. Geiser, SP 0 
440 581, entitled "Temperaturstabiles Bacillus 
tburincTiensis- Toxin" . Repair of this deletion with the 
corresponding region from tbe crylA(a) or cryIA(c> protein 
improves tbe temperature stability of the repaired protein. 
Constructs of the full-length modified cryl&(b) synthetic gene 
are designed to insert sequences coding for the missing amino 
acids at the appropriate place in the sequence without alt erin g 
the reading frame and without changing the rest of the protein 
sequence. The full-length synthetic version of the gene is 
assembled by synthesizing a series of double-'Stranded DtQl 
cassettes, each approximately 300 bp in size, using sta nd a r d 
techniques of DMA. synthesis and enzymatic reactions. The 
repaired gene is said to encode a "heat stable* or 
"teiroerature-stable" cryIA{b) protein, since it retains more 
biological activity than its native counterpart when exposed to 
high temperatures. Specific sequences of maize optimized, heat 
stable cryl^(b) genes encoding temperature stable proteins are 
set forth in Figs. 9, 11, 13 r and 15, and are also described in 
Sxample 7, below. 

The present invention encompasses maize optimized 
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coding sequences encoding other polypeprides, including those 
of other Bacillus rhurinaiensis insecticidai polypeptides or 
insecticidal proteins from other sources. For exastpler cry IB 
genes can be maize optimized, and then stably introduced into 
plants r particularly maize. The sequence of a maize optimized 
crylB gene constructed in accordance with the present invention 
is set forth in Fig. 6. 

Optimizing a 3t I? gene for expression in maize using 
the maize preferred codon usage according to the present 
invention results in a significant increase in the expression 
of the insecticidal gene. It is anticipated that other genes 
can be synthesized using plant codon preferences to improve 
their expression in maize or other plants. Use of mai.ze codon 
preference is a likely method of optimizing and maximizing 
expression of foreign genes in maize. Such genes include genes 
used as selectable or scoreable markers in maize 
transformation, genes which confer herbicide resistance, genes 
which confer disease resistance, and other gcmes which confer 
insect resistance. 

The synthetic crylACb) gene is also inserted into 
Aorobacterium vectors which are useful for transformation of a 
large variety of dicotyledenous plant species. (Example 44) . 
Plants stably transformed with the synthetic crylA(b) 
Aarobactarium vectors exhibit insecticidal activity. 

The native Bt crylA(b) gene is quite A+T rich. The G+C 
content of the full-length native 3t crylA(b) gene is 
approximately 39%. The G+C content of a truncated native Bt 
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crylA(b) gene of about 2 Kb in lengtii is approximacsly 37%. in 
general, maize coding regions tend ro be predomincuitly G<rC 
ricii* The modificatiions made zo tiae 3t crylA(b) gene result in 
a syntiietic IP coding region whicb has greater than 50% G+C 
content r and has about S5% homology at the 0M21 level with the 
native crylA(b) gene. The protein encoded by this synthetic 
CrylA(b) gene is 100% homologous with the native protein^ and 
thus retains full function in terms of insect activity. The 
truncated synthetic CrylA(b) IP gene is about 2 Kb in length 
and the gene encodes the active toxin region of the native 3t 
}curstaJci CrylA(b) insecticidal protein. The length of the 
protein encoded by the truncated synthetic CrylA(b) gene is 648 
amino acids. 

The synthetic genes of the present invention are useful 
for enhanced expression in transgenic plants, most preferably 
in transformed maize. The transgenic plants of the present 
invention may be used to express the insecticidal CrylA(b) 
protein at a high level , resulting in resistance to insect 
pests f preferably coleopteran or lepidopteran insects, and most 
preferably European Com 3orer (EC3) and Sugarcane 3orer. 

In the present invention, the DNA coding sequence of 
the synthetic maize optimized gene may be under the control of 
regulatory elem ent s such as promoters which direct expression 
of the coding sequence. Such regulatory elements , for example, 
include monocot or maize and other monocot functional promoters 
to provide expression of the gene in vaurious parts of the maize 
plant. The regulatory element may be constitutive. That is, 
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it: niay promote conrinuous and srable expression of zhs gene. 
Such aromorers include bur are nor 1:, mired to tiie CaMV 25S 
promorar; riie CaMV 19S promorer; tumefaci ens promorers sucii 
as ocropine synthase promorers, mannopine synr^se promorers, 
nopaline synriiase promorers, or orher opine synthase promorers; 
ubiquirin promorers r acrin promorers, hisrone promorers and 
tubulin promoters. The regularory element may be a 
tissue-preferential promoter, that is, it may promote higher 
expression in some tissues of a planr than in others. 
Preferably, the tissue-preferential promorer may direct higher 
expression of the synrheric gene in leaves, stems, roots and/ or 
pollen than in seed« The regulatory element may also be 
inducible, such as by hear srress, water stress r insect feeding 
or chemical induction, or may be developmenrally regulated. 
Numerous promoters whose expression are Jcnown to vary in a 
tissue specific manner are }cnown in the art. One such example 
is the maise phosphoenol pyruvate carboxylase (PEPC) , which is 
green tissue-specific. See , for example, Hudspeth, R.L. and 
Grula, J.»., Plant Molecular Bioloav 12: 575-58 9 > 1989). Other 
green tissue-specific promoters include chlorophyll a/b binding 
protein promoters and RubisCO small sxibunit promoters. 

The present invenrion also provides isolated and 
purified pith-preferred promoters. Preferred pith-preferred 
promorers are isolared from graminaceous monocors such as 
sugarcane, rice, whear, sorghum, barley, rye and maize; more 
preferred are those isolated from maize plants. 

In a preferred embodiment, the pirh-pref erred promorer 
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is isoiared from a planr TrpA gene; in a most preferred 
embodimenrr it is isolated from a maise TrpA gene. That is, 
the promoter in its native state is operativeiy associated with 
a maize tryptophan synthase-alpha siihxinit gene (hereinafter 
"TrpA") . The encoded protein has a molecular mass of about 
38)cD. Together with another alpha subnit and two beta 
subunits, TrpA forms a multimeric ensyme^ tryptophan synthase. 
Each subunit can operate separately, but they function more 
efficiently together. TrpA catalyzes the conversion of indole 
glycerol phosphate to indole. Neither the maize TrpA gene nor 
the encoded protein had been isolated from any plant before 
Applicants' invention. The Arabidoosis thaliana tryptophan 
synthase beta subunit gene has been cloned sls described Wright 
et al.r The Plant Call , 4:711-719 (1992). The instant maize 
TrpA gene has no homology to the beta subunit encoding gene. 

The present invention also provides purified 
pollen-specific promoters obtainable from a plant 
calcium-dependent phosphate )cinase (CDPK) gene. That is, in 
its native state^ the promoter is operafaly linked to a plant 
CTPK gene. In a preferred embodiment, the promoter is isolated 
from a maize CDPK gene. 3y "pollen-specif iCr " it is meant that 
the expression of an operatively associated structural gene of 
interest is substantially exclusively (i.e. essentially 
entirely) in the pollen of a plant , and is negligible in all 
other plant parrs . By "CDPK, " it is meant a plant protein 
}cinase which has a high affinity for calcium, but not 
calmodulin, and requires calcium, but not calmodulin, for its 
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caralyric acrivity. 

To obtain tissue-preferred or tissue specific 
promoters r genes encoding tissue specific messenger RNA (mRNA) 
can be obtained by differential screening of a cDNA library. 
For example r a pith-preferred cDNA can be obtained by 
subjecting a pitb cDNA library to differential screening using 
cDNA probes obtained from pitii and seed mRNA. See, Molecular 
Cloning, A Laboratory Manual, Sambrook et al. eds. Cold Spring 
Harbor Press: New York (1989). 

Alternately, tissue specific promoters may be obtained 
by obtaining tissue specific proteins r sequencing the 
N-terminus, synthesizing oligonucleotide probes and using the 
probes to screen a cONA library. Such procedures are 
exemplified in the Experimental section for the isolation of a 
pollen specific promoter. 

The scope of the present invention in regard to the 
pith-pref erred and pollen-specific promoters encompa s ses 

functionally active fragments of a full-length promoter that 

> 

also are able to direct pith-preferred or pollen-specific 
transcription, respectively, of associated structural genes. 
Functionally active fragments of a promoter DNA sequence may be 
derived from a promoter DNA sequence, by several art-recognized 
procedures, such as, for example, by cleaving the promoter DNA 
sequence using restriction enzymes r synthesizing in accordance 
with the sequence of the promoter DNA sequence, or may be 
obtained through the use of ?CR technology. See , e.g. Mollis 
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TBchnoloqy, Stociccon Press (New ^otk 1989) . 

Furrlier included witiiiin tiiie scope of tiie instant 
invention are pith-preferred and pollen-specific promoters 
. "equivalent" to the full-length promoters. That is, different 
nucleotides, or groups of nucleotides may be modifiedr added or 
deleted in a manner that does not abolish promoter activity in 
accordance with known procedures. 

A pith-preferred promoter obtained from a maize TrpA 
gene is shown in Fig* 24. Those skilled in the art, with this 
sequence information in hand, will recognize that 
pith-preferred promoters included within the scope of the 
present invention can be obtained from other plants by probing 
pith libraries from these plants with probes derived from the 
maize TrpA structural gene. Probes designed from sequences 
that are highly conserved among TrpA sufaunit genes of various 
species r as dismissed generally in Sxairole 17, are preferred. 
Other pollen-specific promoters r which in their native state 
are linked' to plant CDPK genes other than maize, can be 
isolated in similar fashion using probes derived from the 
conserved regions of the maize CDPK gene to probe pollen 
libraries • 

In another embodiment of the present invention, the 
pith-preferred or pollen-specific promoter is operably linked 
to a DNA sequence, i.e. structural gene, encoding a protein of 
interest, to form a recombinant DNA molecule or chimeric gene. 
The phrase "operably linked to" has an art-recognized meaning; 
it may be used interchangeably with "operatively associated 

30 



wo 93/07278 PCr/US92/08476 

wirhf '•"linked ro, " or "fused ro". 

The structural gene may b homologous or heterologous 
with respect to origin of the promoter and/or a target plant 
into which it is transformed • Regardless of relative origin , 
the associated DNA sequence will be expressed in the 
transformed plant in accordance with the expression properties 
of the promoter to which it is linked. Thus, the choice of 
associated DNA sequence should flow from a desire to have the 
sequence expressed in this fashion. Examples of heterologous 
DNA sequences include those which encode insecticidal proteins, 
e.g. proteins or polypeptides toxic or inhibitory to insects or 
other plant parasitic arthropods, or plant pathogens such as 
fungi, bacteria and nematodes. These heterologous DNA 
sequences encode proteins such as magainins, Zasloff , PNAS USA, 
84:5449-5453 (1987); cecropins, HuitaarJc et ai., Sur. J. 
Bicchem. 127:207-217 (1982); attacins, Huitaaaric et al., SMBO J. 
2:571-576 (1983); meiitria, gramicidin S, Katsu et al., 
3iochem. Biophvs. Acta . 939:57-53 (1988); sodium ciiannel 
proteins and synthetic fragments, Oilci et al. PKRS aSA . 
85:2395-2397 (1988); tile alpha toxin of Stapfavloeoegas aureusm 
Tobkes et al., Bioefaem. . 24:1915-1920 (1985); apolipoprotains 
and fragments tliereof, Knott et al., Science 230:37 (1985); 
Nakagawa et al., J. Am. Ciiem. Soc . 107:7087 (1985); 
alamethicin and a variety of synthetic amphipathic peptides. 
Kaiser et al., Ann. Rev. 3iophvs. aiopfavsl Chem. 16:561-581 
(1987); lectins. Lis et al., Ann. Rev. Biochem. . 55:35-68 
(1986); protease and amylase inhibitors; and insecticidal 
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proteins from Bacillus r.^uringiensiS r parricuiariy tiie 
delta-endoroxins from 3^ t^jiurinciensis ; and from oriier bacreria 

or fungi. 

In a preferred embodimenr of tJie invention, a 
oitli-oref erred oromoter obtained from a maize TrpA subunit gene 
or pollen-specific promoter obtained from a maize CDPK gene is 
operably linked to a heterologous DNA sequence encoding a 
Bacillus tiiurinaiensis ("B.t.") insecticidal protein. These 
proteins and the corresponding structural genes are well Icnown 
in tbe art. See. Hofte and Whiteley, Microbiol - Reviews, 
53:242-255 (1989) . 

Wliile it is recognized that any promoter capable of 
directing expression can be utilized, it may be preferable to 
use heterologous promoters rather than the native promoter of 
the protein of interest. In this manner^ chimeric nucleotide 
sequences can be constructed which can be determined based on 
the plant to be transformed as well as the insect pest. For 
example, to control insect pests in maize, a monocot or maize 
promoter can be operably linked to a Bt protein. The maize 
promoter can be selected from tissue-preferred and tissue- 
specific promoters such as pith-preferred and pollen-specific 
promoters, respectively as disclosed herein. 

In some instances ^ it may be preferred to transform the 
plant cell with more than one chimeric gene construct. Thus, 
for example, a single plant coxild be transformed with a 
pith-preferred promoter operably linked to a at protein as well 
as a pollen-specific promoter operably linked to a Bt protein. 
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The transformed planrs would express 3t proteins in zhe. pianr 
pirh and pollen and to a lesser extent the roots r oucer shearh 
and brace roots. 

For various orher reasons r parricularly management of 
potenrial insect resistance developing to planr expressed 
insecticidal proteins r it is beneficial to express more than 
one insecticidal prorein (IP) in the same plant • One could 
express two different genes (such as two different Bacillus 
thurinaiensis derived delta-endotoxins which bind different 
receptors in the target insect's midgut) in the same tissues , 
or one can selectively express the two toxins in different 
tissues of the same plant using tissue specific promoters. 
Expressing two Bt genes (or any two insect icidal genes) in the 
same plant using three different tissue specific promoters 
presents a problem for production of a plant expressing the 
desired phenotype. Three different promoters driving two 
different genes yields six different insecticidai genes that 
need to be introduced into the plant at the same time. Also 
needed for the transformation is a selectable marxer to aid in 
identification of transformed plants. This means introducing 
seven diffearent genes into the plant at the same time. It is 
most desired that all genes, especially zhe insecticidai genes r 
integrate into the plant genome at the same locus so they will 
behave as a single gene trait and not as a multiple gene trait 
that will be harder to tracJc during breeding of commercial 
hybrids. The total number of genes can be reduced by using 
differential tissue specific expression of the different 
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insecricidal proreins. 

For exampier by fusing crylA(b) with zhe pollen and PEP 
carboxylase promoters, one would obtain expression of this gen 
in green tissues and pollen- Fusing a pith-preferred promoter 
with the crylB delta endotoxin from Bacillus thurinciensis 
would produce expression of this insecticidal protein most 
abundantly in the pith of a transformed plant, but not in seed 
tissues. Transformation of a plant with three genes, PEP 
carboxylase/cryIA(b) , pollen/cryIA(b) , and pith/cry IB produces 
a plant expressing two different 3t insecticidal endotoxins in 
different tissues of the same plant. CrylA(b) would be 
expressed in the "outside" tissues of a plant (particularly 
maize) , that is, in those tissues which European com borer 
feeds on first after hatching. Should SC3 prove resistant to 
crylA(b) and be able to burrow into the stalk of the plant 
after feeding on leaf tissue and/ or pollen, it would then 
encounter the crylB delta-endotoxin and be exposed to a second 
insecticidal coa^onent. In this manner, one can differentially 
express two different insecticdal components in the same plant 
and decrease the total number of genes necessary to introduce 
as a single genetic unit while at the same time providing 
protection against development of resistance to a single 
insecticidal component. 

Likewise, a plant may be transformed with constructs 
encoding more than one type of insecticidal protein to control 
various insects. Thus, a number of vairiations may be 
constructed by one of skill in the art. 
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Tlie recombinanr DNA aoiecuies of the invention may be 
prepared by manipulating the various elements to place them in 
proper orientation. ThuSr adapters or linkers may be employed 
to join the DMA fragments. Other manipulations may be 
performed to provide for convenient restriction sites, removal 
of restriction sites or superfluous DNA. These manipulations 
can be performed by art-recognized methods. See , SambrooJc et 
al . , Molecular Cloning: A Laboratory Manual , Cold Spring 
Harbor Laboratory ?ress, second edition, 19B9. For example, 
methods such as restriction, chewing bacic or filling in 
overhangs to provide blunt ends, ligation of linkers, 
complementary ends of the DNA fragments can be provided for 
joining and ligation* See , Sambrook et al., supra. 

Other functional DIXA sequences may be included in the 
recombinant DNA molecule, depending upon the way in which the 
molecule is to be incorporated into the target plant genome. 
For instance, in the case of Aarobact er ium- mediat ed 
transformation, if Ti- or the Hi- plasmid is used to transform 
the plant cells, the right and left borders of the T-OMA of the 
Ti- and Ri« plasmid will be joined as flanking regions to the 
expression cassette. Acrobactsrium tumefaciens ^mediated 
transformation of plants has been described in Horsch et al., 
Science , 225:1229 (1985); Marton, Call Culture Somatic Cell 
Genetics of Plants , 1:514-521 (1984) ; Hoekema, In: The Binary 
Plant Vector Svstem Of fset-Orukkeri j Kanters 3.V., 
Alblasserdam, 1985, Chapter V Fraley, et al., Crit . Rev. Plant 
Sci., 4:1-46; and An et al., SMBO J. . 4:277-284 {1985). 
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The recombinant DNA aoiecuies of the invention also can 
include a marker gene to facilitate sel ction in recombinant 
olant cells. Sxamoles of maricers include resistance to a 
biocide such as an antibiotic r e.g. Icanamycinr hygromycinr 
chloramphenicol^ paramomycin^ methotrexate and bleomycinr or a 
herbicide such as imidazolones^ sulfonylureas ^ glypfaosater 
phosphinothricin, or bialaphos. Marker genes are well Jcno*m in 
the art. 

In another embodiment of the present inventionr 
plants stably transformed with a recombinant DMA molecule or 
chimeric gene as described hereinabove are provided. The 
resultant transgenic plant contains the transformed gene stably 
incorporated into its genome , and will express the structural 
gene operably associated to the promoter in the respective 
fashion. 

Transgenic plants encompassed by the instant inve n ti on 
include both monocots and dicots. Representative examples 
include maizer tobaccor tomato r cotton^ rape seedr soybean, 
wheats rice, alfalfa, potato and' sunflower . [others?]. 
Preferred plants include maiz&r particularly inbred maize 
plants . 

All transformed plants encompassed by the instant 
invention may be prepared by several m eth ods known in the art. 
A. tumef acien3 *me<^^ at:>*H transformation has been disclosed 
above. Other methods include direct gene transfer into 
protoplasts, Paszkowski et al., SMBO J,, 12:2717 (1984); Loerz 
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Nar-are 319:719 (1986); micropro jecrile bombarctoenr, Klein ec 
al,, 3io/?echnoloqv , 6:559-5S3 (1988); inj ction into 
protioplascSf cultured cells and tissues, Reicii et al., 
Bio/Technology , 4:1001-1004 (1986); or injection into 
meristematic tissues or seedlings and plants as described by De 
La Pena et ai., Nature , 325:274-276 (1987); Graves et al,, 
Plant Mol. Biol, > 7:43-50 (1986); Hooykaas-Van Slogteren et 
al., Nature , 311:763-764 (1984); Griasley et ai,, 
Bio/Technology . 5:185 (1988); and Griasley et al., Nature ^ 
325:177 (1988); and electroporation, WO92/09696. 

The expression pattern of a structural gene operatively 
associated with an instant tissue-preferred or tissue-specific 
promoter in a transformed plant containing the same is critical 
in the case where the structural gene encodes an insecticidal 
protein. For example, the instantly disclosed pith-preferred 
expression pattern will allow the transgenic plant to tolerate 
and withstand pathogens and herbivores that attack primarily 
the pith, but also the brace roots, outer sheath and leaves of 
the plant since the protein will be expressed to a leaser 
extent but still in an insect controlling amount in these plant 
parts, but yet in the case of both types of promoters, will 
leave the seed of the plant unaffected. 

SX&MPLZS 

The following examples further describe the materials 
and methods used in carrying out the invention. They are 
offered by way of illustration, and not by way of limitation- 
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SXAMPLS 1 r General Metnods 

DNA manipuiarions were done using procedures zhaz are 
standard in t:he art. These procedures can often be modified 
and/ or substituted without substantively changing the result • 
Sxcept where other references are identified^ most of these 
procedures are described in Sambrook et al., Molecular Cloning: 
A Laboratory Manual , Cold Spring Harbor Laboratory Press r 
second edition, 1989. 

Synthesis of ONA oligomers: 

DNA oligomers which are from about twenty to about 
ninety, preferably from about sixty to about eighty nucleotides 
in length, are synthesized using an Applied Biosysteos 
model 380B OllA synthesizer and standard procedures. The 
oligomers are made using the updated SSCAF3 cycle on a 0.2 
umoler wide pore, small scale ABI column. The end procedure is 
run trityl off and the oligomer is cleaved from the column 
using the 380B's automatic cleavage cycle. The oligomers are 
then deblocJced in excess ammonium hydroxide (MH^OH) at 55 *C for 
8** 12 hours. The oligomers are then dried in an evaporator 
using nitrogen gas. After completion, the oligomers are 
resuspended in 0.25 - 0.5 ml of deionized water. 

Purification of synthetic oligomers: 

An aliquot of each oligomer is mixed with an ecmal 
volume of blue dye\f oraamide mix with the final solution 
containing 0.05% bromophenol blue, 0.05% xylene cyanol FF, and 
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2S % f ormaaide . This aiixrure is heated at 9S»C for 10 ainuces 
to denature tixe oligomers, Sainpies are ttien applied to a 12 % 
polyacrylamide-urea gel containing 7 M urea (Sambrooic et al.) . 
After electrophoresis at 300-400 volts for 3-4 hours using a 
Vertical Slab Gel Unit (Hoefer Scientific Instruments r San 
Francisco, CA) , UV shadowing is used to locate the correct 
sized fragment in the gel which was then excised using a razor 
blade. The purified gel fragment is minced and incubated in 
0.4 M LiCl, 1 aM EDTA (pH 8) buffer overnight at 37»C, 

Either of two methods is used to separate the oligomers 
from the polyacrylamide gel remnants: Gane\X 25 uM porous 
polyethylene filter units or Millipore's ultrafree-MC 0*45 yM 
filter units. The purified oligomers are ethanol precipitatedr 
recovered by centrifuging in a microfuge for 20 min at 4*C, and 
finally resuspended in TE (10 mM Tris, 1 mM SDTA, pH 8.0) « 
Concentrations are adjusted to 50 ng\ul based on absorption 
readings at 2 SO nm« 

Kinasing oligomers for size determinations: 

To check the size of some of the oligomers on a 
sequencing gel, Icinase labeling reactions are carried out using 
purified synthetic oligomers of each representative size: 
40mers, SOmers, 70mersr SOmerSr and 90mers. In each 20 ul 
]cinasing reaction, one pmole of purified oligomer is used in a 
buffer of 7.0 aM Tris pfl 7.5, 10 mM KCl, 1 mM MgC12) , 0.5 mM 
DTT, 50 ug/ml BSA, 3000 uCi (3 pmoles) of 32P-gammaATP, and 
3 units of T4 polynucleotide kinase. The }cinase reaction is 
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incubated for 1 hour at 37 •Cr followed by a phenol \ciiioroform 
extraction and three ethanol precipitations with glycogen as 
carrier (Tracy, Preo. Biochem, 11:251-268 (1981) . 

Two gel loadings (one containing 1000 cpm^ the other 
containing 2000 cpm) of each reaction are prepared with 25% 
formamide, 0.05% bromophenol blue, and 0.05% xylene cyanol FF. 
The }cinased oligomers are boiled for 5 minutes before loading 
on a S % polyacrylamide^ 7 M urea sequencing gel (BRL Gel 
Mix TM6r 3HLf Gaithersbxirg, MD) . A sequencing reaction of 
plasoid pDClS is run on the same gel to provide size marJcers. 

After electrophoresis, the gel is dried and exposed to 
diagnostic X-ray film (Kodak, X-OMAT AR) . The resulting 
autoradiograph shows all purified oligomers tested to be of the 
correct size. Oligomers which had not been sized directly on 
the sequencing gel are run on a S % polyacrylamide, 7 M urea 
gel (BRL Gel Mix TH6) , using the sized oligomers as size 
markers. All oligomers are denatured first with 25 % formamide 
at lOO^C for 5 minutes before loading on the gel. Sthidium 
bromide staining of the polyacrylamide gel allows all the 
oligomers to be visualized for size determination . 

Hybridizing oligomers for direct cloning: 

Oligomers to be hybridized are pooled together (from 1 
pg to 20 ug total DMA) and kinased at 37 •C for 1 hour in IX 
Promega ligation buffer containing 30 raM Tris-HCl pH 7.8, 10 mM 
MgC12, 10 mM DTT, and 1 mM dATP. One to 20 units of T4 
polynucleotide kinase is used in the reaction, depending on the 
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amounc of roral DNA present. The kinasing reacrions are 
sropped by placing zhe reacrion in a boiling warer barb for 
five minuras. Oligomers to form the 5' termini of the 
hybridized molecules are not Icinased but are added to the 
kinased oligomers along with additional hybridization buffer 
after heating. The poolea oligomers are in a volume of 50-100 
ul with added hybridization buffer used to adjust the final 
salt conditions to 100 mM NaCl, 120 mM Tris pH 7.5, and 10 mM 
MgC12. The }cinased and non-jcinased oligomers are pooled 
together and heated in a boiling water bath for five minutes 
and allowed to slowly cool to room temperature over a period of 
about four hours. The hybridized oligomers are then 
phenol \ chloroform extracted, ethanol precipitated, and 
resuspended in 17 ul of TE {10 mM Tris, 1 mM SDTA, pfl S.O) . 
Using this 17 uir a ligation reaction with a final volume of 20 
ul is assembled (final conditions = 30 mM Tris-HCl pfl 7,8, 10 
mM MgC12r 10 mM DTT, 1 mM ATP, and 3 units of T4 ONA ligase 
(Promega, Madison Wl) . The ligation is allowed to incubate for 
about 2 hours at room temperature. The hybridizedXligated 
fragments are generally purified on 2% Nusieve gels before 
andXor after cutting with restriction enzymes prior to cloning 
into vectors. A 20 ul volume ligation reaction is assembled 
using 100 ng to 500 ng of each fragment with approximate 
equimolar amounts of DNA in 30 mM Tris-HCl pH 7.8, 10 mM MgC12, 
10 mM DTT, 1 mM ATP, and 3 units of T4 DNA ligase (Promega, 
Madison, WI) . Ligations are incubated at room temperature for 
2 hours. After ligation, DNA is transformed into frozen 
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contperenr coli cells using srandard procedures (SambrooJc et: 
al.) and ^ansforsianrs are selecred on I3-agar (Sambrooic ec 
al.) concalnlng 100 ug/nl axnpicillin (see below). 

?CR Reactions for Screening clones in coli : 

3^ coli colonies which contain the correct ONA insert 
are identified using ?CR ( see generally, Sandhu et al., 
BioTechnioues 7:689-590 (1989)). Using a toothpick, colonies 
are scraped from an overnight plate and added to a 20 ul to 45 
ul PGR reaction mix containing about SO pmoles of each 
hybridising primer (see example using primers MK23A28 and 
MK25A28 to select orientation of SacII fragment in pHY32#6) , 
200 ym to 400 mM of each dNTP, and iX reaction buffer (PerJcin 
Slmer Cetus, NorwaDc, C?) . After boiling the 3^ coliXPCR mix 
in a boiling water bath for 10 minutes, 5 ul of Tag polymerase 
(0.5 units) (Perxin Slmer Catus, Norwalk, Conn.) in IX reaction 
buffer is added. The PGR reaction parameters are generally set 
with a denaturing step of 94 for 30 seconds, annealing at 
55 •C for 45 seconds, and extension at 72 •C for 45 seconds for 
30 to 36 cycles. PCR reaction products are ran on agarose or 
Nusieve agarose (FMC) gels to detect the correct fragment sise 
amplified. 

Ligations : 

Restriction enzyme digested fragments are either 
purified in 1% LGT (low gelling temperature agarose, FMC) , 2% 
Nusieve (FMC), or 0.75% agarose using techniques standard in 
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the arr. DNA. bands are visualized with erhidium bromide and 
bands ar recovered from gels by excision wich a razor blade, 
fragments isolated from LGT are ligared direcrly in tihe 
Ten microliters of each recovered DNA fragment is used to 
assemble the ligation reactions, producing final ligation 
reaction volumes of about 23 ul. After excision with a razor 
blade, the recovered gel bands containing the desired DNA 
fragments are melted and brought to IX ligase buffer and 3 
units of T4 DNA ligase (Promega) are added as described above. 
Fragments isolated from either regular agarose or Nusieve 
agarose are purified from the agarose using ultrafree*MC 0.45 
uM filter units (Millipore) and the fragments are ligated as 
described above. Ligation reactions are incubated at room 
temperature for two hours before transforming into frozen 
competent coll cells using standard procedures (Sambrook -et 
al . ) . • 

Transformations : 

Frozen competent coll cells of the strain DHSalpha 
or HBlOl are prepared and transformed using standard procedures 
(Sambrook et al.). E. Coli "SORE* comnetent cells are obtained 
from Stratagene (La Jolla, CA) . For ligations carried out in 
LGT agarose, after ligation reactions are complete, 50 mM CaC12 
is added to a final volume of about 150 ul and the solution 
heated at approximately 55 •C for about 10 minutes to completely 
melt the agarose. The solution is then mixed and chilled on 
ice for about 10 minutes before the addition of about 200 ul of 
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competenr cells which had b en thawed on ice. This mixture is 
allowed to incubate for 30 minutes on ice. The mixture is next 
heat shociced at 42 *C for 60 seconds before chilling on ice for 
two minutes. Next, 800 ul of SOC media (20% tryptoner 0,5% 
yeast extract, 10 mM NaCl, 2.5 mM KCl, adjxisted to ?H 8 with 5 
N NaOH, 20 mM MgC12:MgS04 mix, and 20 mM glucose; SambrooJc et 
al.) is added and the cells are incubated at 37 •€ with shaking 
for about one hour before plating on selective media plates. 
Plates typically are L-agar (Sambrook et al.) containing 100 
pg/ml ampicillin. 

When ligations are carried out in a solution without 
agarose, typically 200 ul of frozen comet ent coli cells 
(strain DHSalpha (BRL, Gaithersburg, MD or Sure cells r 
Stratagene, La Jolla, CA) are thawed on ice and 5 ul of the 
ligation mixture added. The reaction is incubated on ice for 
about 45 to SO minutes, the cells are then heat shocked at 42* 
for about 90 seconds. After recovery at room temperature for 
about 10 minutes, 300 ul of SOC medium is added and the cells 
are then incubated 1 hour at 37 *C with shaking and plated as 
above. 

When screening for inserts into the beta-galactosidase 
gene in some of the standard vectors used, 200 ui of the 
recovered transformation mixture is plated on LB-agar plates 
containing 0.008% X-gai, 80 uM IPTG, and 100 ug/nil ampicillin 
(Sambrook et al.). The plates are incubated at 37* overnight 
to cLllow selection and growth of trans f ormants . 
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Miniscreening DNA: 

Transf oraancs from tiie selecrive media plates are grown 
and tneir plasmid srrucrure is examined and confirmed using 
standard plasmid mini-screen procedures (Sambrook ec al.) . 
Typically, the "boiling" procedtzre is used to produce small 
amoxints of plasmid DMA for analysis (Sambrook et al«) • 
Alternatively/ an ammonium acetate procedure is used in some 
cases. This procedure is a modification of that reported by 
Shing-yi Lee et ai*, Biotechnioues 9:S76-S79 (1990). 

1) Inoculate a single bacterial colony from the 
overnight selection plates into 5 ml (can be scaled down to 1 
ml) of TB (Sambrook et al.) medium and grow in the presence of 
the appropriate antibiotic. 

2) Incubate on a roller at 37 •C overnight. 

3} Collect 5 ml of bacterial cells in a plastic 
Oakridge tube and spin for 5 min. at 5000 rpm in a Sorvall 
SS-34 rotor at 4»C. 

4) Remove the supernatant. 

5) Resuspend the pellet in 1 ml of lysis bufSer (50 mM 
glucose, 25 mM Tris-HClIpH 3.0], 10 mM SDTA and 5 mg/ml 
lysozyme) , vortex for 5 seconds^ and incubate at room 
temperature for 5 min. 

€) Add 2 ml of freshly prepared alkaline solution (0.2 
N NaOH, 1% sodium dodecyl sulfate) , tightly secure lidr mix by 
inverting 5 times and place tube in an ice-water bath for 5 
min. 

7) Add 1.5 ml of ice-cold 7.5 M ammonium acetate (pH 
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7.S) CO ::±ie solution, mix by inverting the tube gently 5 times 
and place on an ice-water batii for 5 lain. 

15) Centrifuge mixture at 9000 rpm for 10 min. at room 
temperatiire . 

9) Transfer clear supernatant to a IS ml Corex tube and 
add 0,S volumes of isopropanol (approx, 2.5 ml). Let sit at 
room temperature for 10 min. 

10) Centrifuge tiie mixture at 9000 rpm for 10 min. at 
room temperature and discard the supernatant. 

11) Resuspend the pellet in 300 ul of TE buffer. Add S 
ul of a stock of RMase A & ?1 (made as a 200 ul solution by 
adding 180 ul of RMase A [3254 Units/mg protein, 5.S mg 
protein/ml] and 20 ul of RNase Tl[481 Units/ug protein, 1.2 mg 
protein/ml] ) . These stocJcs may be pur c hased from USB (US 

« 

Biochemical) . Transfer to a microcentrifuge tube and incubate 
at 37»C for 15 min. 

12) Add 75 ul of distilled water and 100 ul of 7.5 M 
ammonium acetate and incubate in an ice-water bath for 10 min. 

13) Centrifuge the mixture at 14,000 rpm for 10 min. in 
a Becionan microfuge at 4*C. 

14) Precipitate by adding 2.3 volimes of 100% EtOH 
(approx. 1 ml) and incubate in an ice-*water bath for 10 min. 

15) Spin at 14,000 rpm for 10 min. in a microfuge. 

16) Wash pellet with 70% ethanol (using 0.5 ml-1 ml) . 
Dry the pellet and resuspend in 100 ul of IX New Sngland 
Biolabs restriction enzyme Buffer 4 [20 mM Tris-HCl (pH 7.9), 10 
mM magnesium acetate, 50 mM potassium acetate, 1 mM DTT] . 
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Measure ccncenrrarion and checJc purity by speccropiiocometry at 
absorbances 250 and 280 nm. 

For a more rapid determination as to whether or not a 
particular bacterial colony aarbored a recombinant piasmid, a 
PCa miniscreen procedure is carried out using a modification of 
tiie method described by (Sandhu, G.S. et al., 1389, 
BioTechniques, 7:Sa9-€90). Briefly, the following mi«nire is 



100 ul primer mix above, 20 um each primer, 
100 ui dMTP mix (2.5 mM each) 

100 ul lOX AmpliTaq buffer (Perkin-Slmer Cetus, IX 
buffer = 10 nw Tris-HCl pH 8.2, 50 mM KCl, 1.5 mM MgC12, 
0.01% gelatin) 

700 ul deionized water. 



20 ul of the above mixture is put into a a 0.5 ml 
polyproplyene PCX tube. A transformed bacterial colony is 
Piciced with a toothpick and resuspended in the mixture. The 

is put in a boiling water ijath for 10 minutes and then 
cooled to room temperature before adding 5 ul of the mix 
described below: 

265 ul deionized water 

30 ul lOX Amplitaq buffer (PerJcin-Slmer Cetus, IX 
buffer = 10 mM Tris-HCl pH 8.2, SO mM XCl, 1.5 mM MgC12, and 
0.01% gelatin) 

7.5 ul Taq polymerase 
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The samples ar overlaid witii 50 ul of aiineraJ. oil and 
BCSL is carried put for 30 cycles using th following 

parameters : 

denature: 94* for 1 min 
anneal: 55* for 1 min 
extend: 72* for 45 seconds. 

After PCa amplification, 1 ul of loading dye (30% 
glycerol, 0.25% Bromophenol blue, 0.25% xylene cyanol) is added 
to the whole reaction and 20 ul of the mixture is loaded on a 
2% Nusiever 1% agarose gel to see if there is a ?CR product of 
the expected size. 

This procedure is used as an initial screen. Hinipreps 
are subsequently carried out to confirm the structure of the 
plasmid and its insert prior to sequencing. 

SXaMPLS 2: AMPLIFICATION AND ASSEMBLY OF SACg QPaRTBR 

Cloning fragments of the synthetic Bt crylA(b) gene: 

The synthetic gene was designed to be cloned in four 
pieces f each roughly one quarter of the gene. The oligomers 
for each quarter were pooled to either be assembled by PGR, 
hybridization, or a combination of hybridization followed by 
PGR amplification as described elsewhere . Synthetic quarters 
were pieced together with overlapping restriction sites Aat 11^ 
Ncol, and Apa I between the 1st and 2nd, 2nd and 3rd, and 3rd 
and 4th quarters respectively* 

Sach quarter of the gene (representing about 500 bp) 
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was assembled by hybridizing the appropriate oligomers and 
amplifying the desired fragmcnr using ?CR primers specific for 
the ends of that cuarter. Two different sets of PGR reactions 
employing two sets of slightly different primers were used. 
The ?CR products of the two reactions were designed to be 
identical except that in the first reaction there was an 
additional AATT sequence at the 5' end of the coding region and 
in the second reaction there was an AGC7 sequence at the 3' end 
of a given quarter. When the products of the two reactions for 
a particular quarter were mi:ced (after removing the polymerase, 
primers and incomplete products) , denatured, and subsequently 
re-annealedr a certain ratio (theoretically 50%) of the 
annealed product should have non-homologous overhanging ends. 
These ends were designed to correspond to the "sticky ends* 
formed during restriction digestion with ScoRI at the 5' end 
and Hind III at the 3' and of the molecule. The resulting 
molecules were phosphorylated, ligated into an ScoRI/Hindlll 
digested and phosphatased Bluescript vector, and transformed 
into frozen competent E-^ coli strain DHSalpha. After 
selection, the B^^^ coli colonies containing the desired fragment 
are ideaitified by restriction digest patterns of the ONA. 
Inserts representing parts of the synthetic gene are 
subsequently purified and sequenced using standard procedures. 
In all cases, clones from multiple PCS. reactions are generated 
and sequenced. The quarters are then joined together using the 
unique restriction sites at the junctions to obtain the 
complete gene. 
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Cloned quarrers are identified by mini-screen 
procedures and zhs gene fragaenr sequenced* It is found that 
errors are frequently introduced into the sequence^ most 
probably during the PGR amplification steps • To correct such 
errors in clones that contain only a few such, errors , 
hybridized oligomers are used. Hybridized fragments are 
digested at restriction enzyme recognition sites witiiin tbe 
fragment and cloned to replace the imitated region in the 
syntbetic gene. Hybridized fragments range from 90 bp in 
lengtb (e.g. tbe region tbat replaces tiie fragment between tbe 
Sac II sites in tbe 2nd quarter) to tbe about. 350 bp 4tb 
quarter fragment tbat replaces two PGR induced mutations in tbe 
4tb quarter of tbe gene. 

Due to tbe higb error rate of PClr a plasmid is 
designed and constructed wbicb allows the selection of a cloned 
gene fragment tbat contains an open reading frame. This 
plasmid is designed in sucb a manner tbat if an open reading 
frame is introduced into tbe cloning sites, tbe transformed 
bacteria could grow in tbe presence of Jcanamycin. Tbe 
construction of tbis vector is described in detail below. Tbis 
selection system greatly expedites tbe progress by allowing one 
to rapidly identify clones with open reading frames witbout 
having to sequence a large number of independent clones. Tbe 
s3mtbetic quarrers are assembled in various plasmids, including 
3SSK (Stratagene; La Jolla, Ca) , pDClS (SambrooJc et al.), and 
tbe Xm-expression vector. Other suitable plasmids, including 

m 

pUC based plasmids, are 3cnown in the art and may also be used. 
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Compiere sequencing of cloned fragments ^ western blor analysis 
of cloned gene products r and insect bioassays using Europecin 
com borer as the test insect verify that fully functional 
synthetic Bt crylA(b) genes have been obtained - 

Construction of the Km-*expression vector to select open reading 
frames : 

The Km-expression vector is designed to select for fragments of 
the synthetic gene which contain open-dreading frames. PCa 
oligomers are designed which allow the fusion of the NPTIl gene 
from TnS starting at nucleotide 13 (Reiss et al., 5MB0 J, 
3:3317-3322 (1984)) with pOClS and introduce useful restriction 
sites between the DNA segments. The poly linker region contains 
restriction sites to allow cloning various synthetic Bt IP 
fragments in-frame with the Km gene. The 88 bp 5' oligomer 
containing the polyiinker region is purified on a S% 
polyacrylamide gel as described above for the oligomer PAGE 
purification. A PCa reaction is assembled with a 1 Kb Bgl 
IlXSma I template fragment which contains the NPT II gene 
derived from TnS. The PGR reaction mix contains 100 ng of 
template with 100 pmols of oligomers KE72A28 and XE74A28 (see 
sequences below), 200 nM dNTP, and 2.5 Units of Tag polymerase 
all in a 50 ul voliane with an equal volume of mineral oil 
overlaid. Sequences of the primers are: 

KE74A28 

5 ' -GCAGATCTGG ATCCATGCAC GCCGTGAAGG GCCCTTCTAG AAGGCCTATC 
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GATAAAGAGC TCCCCGGGGA TGGATTGCAC GCAGGXTC-S' 
KE72A28 

5'-GC3TTAACAT GTCGACTCAG AAGAACTCGT CA&GAAGGCG-3' 

The ?CR parameters used are: 94*C for 45 seconds (sec), 
55 *C for 45 sec, and 72*C for 55 sec with the extension ar step 
3 for 3 sec for 20 cycles. All ?CR reactions are carried out 
in a Perjcin-Slmer Cetus tiiermocycler. The amplified ?CR 
product is 800 bp and contains the polylinker region with a 
translational start site followed by unique restriction sites 
fused in^frame with the Km gene from base #12 running through 
the translational terminator . paC:KM74 is the Km^expression 
cassette that was assembled from the 800 bp Bgl II\Sal I 
polylinker /Km fragment cloned in the PUC18 vector. The lacZ 
promoter allows the Xm gene to be expressed in coli . 
pDC:KM74 derivatives has to first be plated on L3-agar plates 
containing 100 ug/ml ampicillin to select transformanrs which 
can subsequently be screened on LB-agar plates containing 25 
mg/ml lcanamycin/I?TG. Synthetic Bt IP gene fragments are 
assembled from each quarter in the Km-cassette to verify 
cloning of open«reading- frame containing fragments pieces. The 
first HC3 active synthetic 3t IP gene fragment, pBt:Km#6, is a 
3t I? gene that shows Km resistance. This fragment is 
subsequen t ly discovered to contain mutations in the 3rd and 4th 
quarter which are later repaired. 
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EXAMPLE 2A: SYNTHESIS AND CLONING OF THE FIRST QOARTER OF THE 

SYNTHETIC GENE tbase oairs 1 no 5501 

The following procecrures are followed in order to clone 

the first quarter of the synthetic DNA sequence encoding' a 

synthetic Bt crylA(b) gene. The same procedures are 

essentially followed. for synthesis and cloning of the other 

(quarters r except as noted for primers and restriction sites • 

Template for Quarter 1: Mixture of equal amounts of purified 
oligomers ni-QT and LI to L7 

PGR Primers: 
Forward: 

PI (a) : 5' -GTCGSkCAAGG ATCCAACAAT GG-3' 

PI (b) : S'-AATTGTCGAC AAGGATCCAA CA&TGG-3' 

Reverse : 

P2 (a) : 5' -ACACGCTGAC GTCGCGCAGC AC3--3' 
?2 (b) : 5'-AGCTACaCGC TGACGTCGCG CAG-3' 

Primer pair Al: PI (b) + ?2(a) 
Primer pair A2: PI (a) P2 (b) 

The PGR reaction containing the oligomers comprising 
the first quarter of the synthetic maize-optimized 3t IP gene 
is set up as follows: 
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200 ng oligo mix (ail oligos for the quarrer mixed i 
equal amounts based on weigiir) 

10 ul of primer mix (1:1 mix of each ac 20 uM; 
axe described above) 
5 ul of lOX PGR buffer 
PGR buffer used may be either 

(a) IX concentration =- 10 mM KCl. 10 mM (NH4)2S04, 20 
mM Tris-HClr pH 8.0, 2 mM MgS04, and 0.1% Triton X-100) , or 

(b) IX concentration = lOmM Tris-HCl pH 8.3, 50 mM KCl 
l,5mM MgCl2f 0.01% wt/vol gelatin. 

Components are mixed, heated in a boiling water bath 
for 5 minutes, and incubated at 65»C for 10 minutes. 

Nextr the following reagents are added: 

3 ul of dNTPs mixture (final concentration in the 
reaction * 0.2 mM each) 

5 units polymerase. 



The final reaction volxane is SO microliters. 

OligomMS are then incubated for 3 min at 72 •C and then 
a PGR cycle is ran. The Pd reaction is run in a PerJcin Elmer 
thermocycler on a step cycle protocol as follows: 



denaturation cycle 
annealing cycle 
extension cycle 
cycle) 

number of cycles: IS 



94 • for 1 minute 
60* for 1 minute 
72* for 45 seconds 3 sec per 
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* 

After zh reaction is complete, 10 ul of the ?CR 
reaction is loaded on a 2% Nusieve-GTG (FMC) , 1% agarose 
analytical gel to monitor the reaction- The remaining 40 ul is 
used to clone the gene fragments as described below « 

PCS Products 

The termini of the double stranded PGR product 
corresponding to the various primer pairs are shown (only upper 
strand) : 

Al AATTGTCGAC ^GCGTGT (554 bp) first qtr. 

A2 GTCGaC GCSTGTAGCT (554 bp) first qtr. 

Hybridization 

40 ]il of each of the PC?, reactions described above is 
purified using a chromaspin 400 column (Clonetechr Palo Alto^ 
CA) according to manufacturers directions. Five ug of carrier 
ONA was added to the reactions before loading on the column. 
(This is done for most of the cloning. However , in some 
reactions the ?CR reactions are phenol: chloroform extracted 
using standard procedures (Sambroolc et al*} to remove the Taq 
polymerase and the generated DNA is recovered from the 
aqueous phase using a standard ethanol precipitation 
procedure.) The carrier DNA does not elute with the PGR 
generated fragments. The Al and A2 reaction counterparts for 
each quarter are mixed^ heated in a boiling water bath for 10 
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minutBs and then incubated at 65 *C ovemigat. The reactions 
are then removed from th 65* bath and ethanol precipitated 
with 1 ul (20 ug) of nuclease fre glycogen (Tracy, Prep> 
aiochem. 11:251-268 (1981) as carrier. The pellet is 
resuspended in 40 ul of deionized vater. 

Phosphorylation reaction 

The phosphorylation reaction is carried out as follows: 
40 ul DMA 
2.5 ul 20 aM ATP 

0.5 ul IDX 3SA/DTT (IX » 5 mM DTT, 0.5 ag/ml 3SA) 
1-0 Ul lOX polynucleotide kinase buffer (IX » 70 

mM Tris.flClr 

?H 7,6, 0.1 M KCl, 10 mM MgC12> 

2.0 ul polynucleotide kinase (Mew England BiolabSr 

20 units) . 

Incubation is for 2 hours at 37*C. 

The reaction is then extracted one time with a 1:1 
phenol : chloroform mixture, then once with chloroform and the 
aqueous phase ethanol precipitated using standard procedures. 
The pellet is resuspended in 10 ul of TS, 

Restriction Digests 

20 ug of aiuescript vector (BSSK+, Stratagene, La 
Jolla, CA) 

10 ul 10 X restriction buffer (IX « 20 mM Tris-flCl pH 
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a.O, 10 mM MgC12, 100 mM NaCl) 

5 ui Eco RI (New Sngiand Biolabs) 100 unirs 

5 ul Hind III (N w Sngiand Biolabs) 100 units 

Final reacrion volume is 100 ul- 

Incuisatiion is for 3 hours at: 37 
When completied^ the reaction is extracted with an equal volume 
of phenol saturated with TE (10 mM Tris.HCl ?H 8.0 and 10 mM 
EDTA) . After centrifugation, the aqueous phase was extracted 
with an equal volume of 1:1 mixture of (TE saturated) 
Phenol: chloroform (the "chloroform" is mixed in a ratio of 24:1 
chloroform: isoamyl alcohol), and finally the aqueous phase from 
this extraction is extracted with an equal volume of 
chloroform. The final aqueous phase is ethanol precipitated 
(by adding 10 ul of 3 M sodium acetate and 250 ul of absolute 
ethanol r left at 4* for 10 min and centrifuged in a microfuge 
at maximum speed for 10 minutes. The pellet is rinsed in 70% 
ethanol and dried at room temperature for 5-10 minutes and 
resuspended in 100 ul of 10 mM Tris.HCl (pH 8,3). 

Vector DHA is routinely treated with phosphatase to 
reduce the number of colonies obtained without an insert. Calf 
intestinal alkaline phosphatase is typically used (Sambroolc et 
al . ) , hut other phosphatase enzymes can also be used for this 
srep. 

Typical phosphatase reactions are set up as below: 
90 ul of digested DNA described above 
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10 ul of lOX Calf intestinai aiJcaline phospixatase 
buffer (1X=50 mM Tris-HCl (pH 3.3), 10 3iM MgC12, 1 mM ZnC12, 10 
mM spemidine) 

1 ul (1 unit) of calf incesrinal alkaline paosphacase 
ICIS, Boehringer Mannheim,. Indianapolis r IN) 

Incubation is at 37 •C for 1 hour. 

The DNA is then gel purified (on a 1% low gelling 
temperature (LGT) agarose gel) and the pellet resuspended in 30 
ul T£. After electrophoresis r the appropriate band is excised 
from the gel using a razor blade, melted at 65* for 5 minutes 
and diluted 1:1 with TE. This solution is extracted twice with 
phenol/ once with the above phenol: chloroform mixture, and once 
with chloroform. The final aqueous phase is ethanol 
precipitated and resuspended in TE buffer. 

Ligation: 

To ligate fragments of the synthetic gene into vectors, 
the following conditions are typically used. 

5 ul of phosphorylatad insert DNA 

2 ul of phosphatased Sco RI/Hind III digested 
Bluescript vector heated at 65* for 5 minutes, then cooled 

1 ul lOX iigase buffer (IX buffer=30 mM Tris.HCl (pH 
7.3), 10 mM MgC12, 10 mM DTT, 1 mM ATP) 

1 ul 3SA (1 mg/ml) 

1 ul ligase (3 units, Promega, Madison, Wise.) 
Ligase reactions are typically incubated at 16* 
overnight or at room temperature for two hoxirs . 

58 



wo 93/07278 PCr/US92/a8476 

Transfamarion ; 

Trans formacion of ligared DNA fragments into E^^^ coll is 
performed using standard procedures (Sambrook et al.) as 
described above. 
Identification of recombinants 

White or light blue colonies resulting from overnight 
inctibation of transformation plates are selected. Plas m ids in 
the transf ormants are characterized using standard mini-screen 
procedures (Sambrooic et al.) or as described above. One of the 
three procedures listed below are typically employed: 

(1) boiling DNA miniprep method 

(2) PC^ minis creen 

(3) Ammoniimi acetate miniprep. 

The restriction digest of recombinant plasmids believed 
to contain the first quarter is set up as follows: 

(a) 3am HI/Aat II digest: 10 ul DNA + 10 ul IX New 
Sngland 3iolabs restriction enzyme Buffer 4 

0.5 ul Bam HI (10 units) 

0.5 ul Aat II (5 units) 

Incubation is for about 2 hours at 37 

Clones identified as having the desired restriction 
pattern are next digested with Pvu II and with Bgl II in 
separate reactions. Only clones with the desired restriction 
patterns with all three enzyme digestions are carried further 
for sequencing. 

Sequencing of cloned gene fragments: 
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Sequencing is perrormed using a modification of 
Sanger's dideoxy chain terminarion metiiod (Sambrooic et ai,) 
using double stranded DUA with th Sequenas 2 Icit (United 
States Biochemical Corpw Cleveland, OH). In all, six first 
quarter clones are sequenced. Of the clones sequenced^ only 
two clones designated pQAl and pQAS are foxind to contain only 
one deletion each. These deletions are of one base pair each 
located at position 452 in pQAl and position 297 in pQ&5- 

Plasmid pOSl is used with pPl-8 (as described below) t 
obtain a first quarter with the expected sequence. 



EXAMPLE 2B: SYNTHESIS AND CLONING OF THE SECOND 

QUARTER rbase oairs 531 to 10501 

Template: oligomers 08-014 and L8-L14 

?CR Primers: 

forward: 

P3 (a): 5'-GCTGC3C3AC GTCAGCSTGT TCSG-S' 
P3 (b) : 5'-AATTGCTGCG CGACGTCAGC GTG-3' 



P4 (a): 5'-GGCGTTGCCC ATGGTGCCGT ACAGG-3' 
?4 (b) : S'-AGCTGGCGT TGCCCATGGT GCCG-3' 



Primer pair 31: P3{b) ^ ?4(a) 
Primer pair 32: ?3{a) ^ ?4 (b) 



PCR Products 
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Bl AAT7GCTGC3 AACZCZ (524 bp) second 

ouarrer 

32 GCTGCG ^AACGCCAGCT (524 bp) 

Hybridizacion^ PCB. aznclificariorir spin column size 
fracrionarion, and cloning of zti±s gene fragment in Bluescripr 
digested with Sco RI/Hind III are performed as described above 
for the first quarter (Example 2A) . The Pd product for this 
quarter is about 529 bp in size representing the second quarter 
of the gene (nucleotides 531 to 1050) • Transformation is into 
frozen competent coli cells (DHSalpha) using standard 
procedures described above (SambrooJc et al.) 

Miniscreen of pQB clones: 

Miniprep DNA is prepared as described above and 
digested with (a) Aat II/Nco I, (b) Pvu II and (c) with Bgl I 
to confirm the structure insert in the vector before 
sequencing. 

Sequencing is performed as described above using the 
dideoxy method of Sanger (Sambrook et al.) . 

A total of thirteen clones for this quarter are 
sequenced. The second quarter consistently contains one or 
more deletions between position 884 and 387. In most cases the 
G ac position 884 is deleted. 

Plasmid pQB5 had only one deletion at position 884. 
This region lies between two Sac II sites (positions 859 and 
949) . Correction of this deletion is described in Example 3. 
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Clones of zHe firsr half (1-1050 bp) . 

A fragment for cloning tiie first half (quarters 1 and 
2) of the, synthetic at maize gene as a single DNA fragment is 
obtained by restriction digestion of the product of a PGR 
reaction coa^rising the first quarter and the second quarter. 
Restriction endonuciease Aat II is used to cut the DNA 
(following phenol extraction and ethanol precipitation), in a 20 
Ul reaction. 15 ul of each of the Aat II digested quarters is 
mixed and ligatad (in a 30 ul volume by adding 5 ul of lOX 
ligase buffer, (1X=30 mM Tris-HCl ?H 7.8, 10 mM MgC12, 10 mtt 
DTT, 1 mM ATE) 14 ui of deionized water and 1 ]xl of T4 DNA 
ligase, 3 units, Promega, Madison, WI) at room temperature for 
2 hr. The result is an about 1 kb fragment as judged by 
electrophoresis on a 1% agarose gel run using standard 
conditions {Sambroolc et al.) Ten ul of the ligation product is 
amplified by ?CR using conditions described previously except 
that only 5 cycles were run. 

Primer Pair: HA« PI (a) + ?4 (b) 
Primer Pair: HB- Pl(b) + ?4 (a) 

The product of these reactions is cloned into 
31uescript (Stratagene, La Jolla, CA) as described for the 
individual quarters. This procedure is only done once i.e., 
all insert DNA is obtained in a particular region from a single 
PGR reaction. 

Thirty-six colonies are miniscreened with Sal I digests 
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and Pvu II digests. All except 4 conrain an inserr of 
approximarely 1 }cb in size of which ac leasr 20 contain the 
correct ?vu II digestion pattern. Sight of these clones are 
selected for sequence analysis. One of the clones, ?1*8, has 
the desired sequence between the Sco NI site (396 bp) and the 
Dra III site (640 bp) . This clone is used to obtain a plasmid 
with the desired sequence up to the Dra III site (640 bp) in 
the second quarter with pQAl (first quarter with a deletion at 
position 452 bp described previously.) 

SXAHPLE 2C: CLONING AND SYNTHESIS OF THIBD 

QDARTER Fbase oairs 1021 to 15001 

Template: Oligos UlS-UZO and L1S-L21 

PCa primers: 

forward 

?5 (a): 5'-TTCCCCCTGT ACGGCACCAT GGGCAACGCC GC-3' 
?5 (b) : 5'-AATTGTACGG CACCATGGGC AAC-3' 



reverse 

P6 (a): S'-GAAGCCGGGG CCZTTCACCA CGCTGG-3' 
P6 (b) : 5'-AGCTGAAGCC GGGGCCCTTC ACC-3' 

Primer pair CI: P5(b) + P6(a) 
Primer pair C2: ?5(a) + P6(b) 

PCS. Producr: 

CI A&TTGTACGG^ ^GGCTTC (475 bp) 3d qtr- 
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C2 TTCCCCTGTACGG ^GGCTTCAGCT (484 bp) 3ci qcr 



?ca reacrions, spin column recovery of the correcr 
sized DNA fragmenc, and ligation inro vectors are performed as 
described above (Example 2&) using a Bluescript vector cut witb 
SCO RI and Hind III. The approximately 479 base pair 9CR 
product represents tbe tiiird quarter of the synthetic gene (NT 
1021 - 1500) • 

Transformation into frozen competent coli strain 
DHSalpba cells, selection and identification of transf ormants, 
characterisation of transf ormants by mini-screen procedures^ 
and sequencing of the synthetic gene fragment in the vector are 
all as described above. 



Mini screen of pQC clones: 

The third quarter is miniscreened using standard 
procedures (Sambrook et al.) . Miniprep DNA is cut with (a) Nco 
I/Apa I and (b) with ?vu II. Clones containing the correct 
restriction digest patterns are sequenced using st andar d 
procedures- A total of 22 clones of the third quarter are 
sequenced. Three major deletion "hotspcts" in the third 
quarter are identified (a) at position 1083 (b) between 
position 1290-1297 and (c) between positions 1356- 13 62. In all 
clones except one, pQCSf there is also consistently an 
insertion of a C at position 1365. In addition to these 
mutations r the third quarter clones contain a large number of 
other apparently random deletions. The common factor to the 

64 



wo 93/07278 



PCr/US92/08476 



rixree murational "iiotspors" in tiie ziiirci quarrer and tiie one in 
tile second quarrer is tiiat cliese regions are ail flanked on 
either side by sequences tiiat are about 30% C+G. Otber regions 
containing 5 to 9 C-Gs in a row are not affected. The 
oligomers in ai5, ai6, UlS, U19, LIS, LIS, LIB and L19 are 
redesigned to reduce tbe C^G content in these regions. Five 
clones each from PCR reaction using the modified oligomers are 
seouenced. 

Plasmid pQCN103 has the correct sequence for the third 

quarter except for a change at position 1226. This change, 

which siibstitutes a G for a C, results in the substitution of 

one amino acid (leucine) for the original (phenylalanine) . 

EXAMPLE 2D: SYNTHESIS ittlD (2»0NING OP FOURTH 

QOARTSR rbase oairs 1480 to 19601 

The fourth quarter of the gene is obtained from a clone 

which is originally designed to comprise the third and fourth 

quarters of the gene. The "second half* of nhe synthetic gene 

is obtained from PGR reactions to fuse the third and fourth 

quarters. These reactions are run with Pd primers PS (a) and 

PS (a) described above for the third quarter and primers P7 (a) 

and pa (a) (described below) . The reverse primer* is modified to 

include a Sac I site and a termination codon. Separate 

reacrions for each quarter are run for 30 cycles using the 

conditions described above. The two quarters are joined 

together by overlapping ?CR and subsequently digested with 

restriction enzymes Nco I and Sac I. The resulting 953 bp 

fragment is cloned dirsctionally into ?CI33054, which has been 
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cut with Nco I/Sac I and r reared with allcaline phosphatase. 

pCIB3054 is constructed by inserting intron #9 of 
PBPcarboxylase (PEPC ivs #9) in the unique Hpa I site of 
pCIB246 (described in detail in Sxample 4} pCIB246 is cut with 
Hpal and phosphatased with CIF using standard procedures 
described in Sxaople 2A. ?SPC ivs #9 is obtained by PGR using 
pPSP-10 as the template. pPSP*10 is a genomic subclone 
containing the entire maize PSP carboxylase gene encoding the 

photosynthetic enzyme, plus about 2.2 Kb of 5 '-flanking and 
1.8 Kb of 3'-flanicing DNA. The 10Kb DNA is ligated in the 
Hindlll site of pQClB. (Hudspeth et al.. Plant Molecular 
Bioloov / 12: 576-5B9 (1989) . The forward PGR. primer used to 
obtain the ?SPCivs#9 is GISUIiAAAAACCAGCII^TC and the reverse 
primer is C7GCACAAAG76GAGTA67 . The PGR product is a 108 bp 
fragment containing only the PSPcarboxylase intron #9 
sequences. The PGR reaction is extracted with phenol 
and chloroform, ethanol precipitated phosphorylated with 
polynucleotide Icinase and treated with T4 polymerase to fill in 
the 3' nontemslated base addition found in PGR products (Glaric, 

Nucleic Acid Research . 16: 9677-9686 (1988)) using 
standard procedures. The kinased fragment is blunt -end cloned 
into the Hpal site of pCIB246, using standard procedures 
described earlier. 

Amplification and Assembly of the Fourth Quarter 
Template: 021-026 and L22-L28 
PCS> primers 
FORWARD 
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97 la): 5' -TGSTGAAGGG CCCC3GCTTC ACCGG-3' 
REVERSE 

?8 (a); 5' -ATCXTCGATG AGCTCCTACA CCTGATCGAT GTGGTA-3' 
PRIMER PAIR 4: P7 (a) + ?8 (a) 
PRIMER PAIR 2: P5 <A) + P6(a) 

Primer pair for overlapping PGR : P7 (a) + P8 (a) 
PGR Product 

fourtil quarter: GGTGAA ^ATCAGGAGCTCATCGATGAT 

(484 bp) third quarter: TTCCCCCTGTA TTCACCGG 

(484 bp) second lialf : GGTGAA -CATGATGAT (953 bp) 

Four positive clones are identified by plasmid 
miniscreen and are subsequently sequenced using standard 
orocedures. 

Plasmid Bt.P2 #1 conrains approximately zhe correct: 
fourrli quarter sequence except for two mutations. T&ese 
mutations are at position 1S23 (substituting an A for a Gr 
resulting in an amino acid change which substitutes a His for 
an Arg) and at position 1S34 (substituting a T for a 
resulting in an amino acid substitution of a Ser for a Thr) • 

Plasmid Bt.P2 #1 is used in the construction of 
pCI34414 described below. (The mistakes are ultimately 
corrected by hybridizing all the oligos of the fourth quarter, 
digesting with Apa I/Bst S II and replacing that region in 
pCIB4414. Therefore, only sequences from position 1842-1960 
remain of Bt.?2#l in the final construct.) 
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The synriietic maize optimized 3t crylA(b) gene is 
designed zo be cloned in quarters . Using the ?CR teciinique, 
however, resuitis in murations, which in most cases are 
deletions resulting in frameshift mutations, Piasmids 
containing individual quarters are therefore sequenced and the 
correct parts ligated together using standard procedures. 

After obtaining first and second quarter clones with 
almost the desired sequence^ piasmids pEBlQ#4 and pEBlQ#5 are 
constructed to obtain the desired sequence of the synthetic Bt 
gene up to the Dra III site at the base pair position 534 (this 
mutation destroys the Dra III site) . The pEBlQ constructs are 
made by ligating a 3.9 JQ) Bco NI\Bam HI fragment from pPl-8 
with a 400 bp fragment from pQ&l. pEBlQfS has the desired 
sequence up to the Dra III sitef but pEBlQ#4 has a mutation at 
base pair position 378. 

Piasmids ?1H1M4 and plHlMS are constructed to repair 
the Dra III site in pEBlQ#4 and pEBlQ#5. Piasmids plHlM#4 and 
#5 are made by ligating a 3.5 Kb Nco I\Aat II fragment from 
pEBlQ#4 and #5 respectively, with a 500 bp Nco I\Aat II 
fragment from pQB5. Plasmid plHlM5 contains a mutation between 
the Sac IZ sites at position 884 in the second quarter of the 
synthetic 3t gene. Plasmid plHlM4 contains the additional 
mutation as described in its precursor construct p£BlQ#4. 

The Sac II site in the Bluescript vector region of 
P1H1M4 is deleted by cutting plHlM4 with Not I and Sac I and 
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converting these sites zo biunr ends using T4 DNA polymerase 
under standard conditions before ligating this 3.9 Kb fragment 
to make ?1H1M4'*S. Deleting the Sac II site in the vector 
region allows the 90 bp Sac II fragment with the nnitation at 
position 884 in the 2nd quarter of plHlM4'^S to be removed prior 
to replacement with a 90 bp Sac II fragment. Oligomers a\L 12 
and 13 are kinased and hybridized (described above) before 
cutting with Sac II and isolating a 90 bp fragment on a 2% 
Nusieve gel. The Sac II fragment is ligated into the about 3.8 
Kb Sac II cut plHlM4'^S vector which has been phosphatased with 
CIP. The repaired Sac II construct is called pH3ra2#6. The 
orientation of the Sac II fragment in pHYB2#6 is detected by 
?CR screening as described earlier using the following primers: 

MK22A28 = 5' -GGSGCTGCSGATGCTGCCCT-3' 
MK2SA28 = S'-GAGCTGaCCCTGaCCSTGCT-S' 
MK26A28 =» 5' -CaCCTGATGGaCATCCTGaA-S' 

Running the 9CR reactions with 50 pmoles of primers 
MK23A28 and MK25A2B produces an approximate 180 bp fra gment » 
indicating the inserted fragment bounded by the Sac IZ sites in 
pHYB2#6 is in the correct orientation* Using primers MK2SA28 
and MK2SA28 in the PGR screening acts as the negative control 
producing an approximate 180 bp fragment only in constructs 
containing the Sac II boxmded fragment in the wrong 
orientation. pflYB2#6 sequence is determined using standard 
procedures. 
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pHY32#6 has on mucation at position 378 which needed 
to be repaired to obtain a first quarter containing the desired 
sequence . 

Plasmid plHG#6 contains the desired sequence for the 
entire first half of the synthetic Bt gene. plH6#6 is made 
from a 3.4 Kb Aat II\Nco I fragment of plHlMS#2 ligated to a 
500 bp Aat II\Nco I fragment from pfiYB2#6. 

To identify clones or partial clones of the synthetic 
gene which contain open reading frames^ the kanamycin selection 
vector, (described above) is used. The foizrth quarter ox the 
synthetic 3t gene is the first put into the kanamycin cassette. 
pKH74-4 contains the approximately 500 bp Apa I\Cla I fragment 
from plasmid 3tS2 (which had been previously transformed into a 
dam- S. coli strain (PO-lOO) to be able to cut with Cla I) , 
ligated to paC:KM74 cut with Apa I\Cla I. Plasmid pKM74-4 
displays k an a my cin resistance bur is later found to contain two 
substitution mutations at positions 1522 and 1534 (mutations 
are described above in the section on cloning the fourth 
quarter; they are substitutions , not deletions or insertions) - 

The correct first half of the synthetic Bt gene from 
plasmid plHG#6 is inserted into plasmid pKM74-4, The resulting 
plasmid^ called pKml24, is made from the about 3.3 Kb Apa I\Baa 
HI fragment derived from pKM74-4 ligated to 1 Kb Apa I\Bam HI 
fragment from plBG#6. pKml24 shows kanamycin resistance. This 
plasmid contains the first, second, and fourth quarters of "^ ti** 
synthetic gene forming a single open reading frame. 

The third quarter of the synthetic gene is next cloned 
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inro pKM124, The first funcrional clone, in piasmid pBt:Km#6., 
is a fimcrional copy of tiie crancated synriiecic cryIA<b) gene 
in tlie Km-casset^e wiiica displays ]canainycin r sistance but 
which contains deletion mutations between the third and fourth 
quarters. ?lasiziid p£t:Xzzi#6 is made from the approxizoately 5 Kb 
Apa I\Nco I pKM124 vector fragment ligated to the approximately 
500 bp Apa IXNco I fragment from pQCN103 (pQCN103 contains a 
mismatch mutation at position 1326 which is repaired later) * 
Contaminating nuclease activity appears to have deleted the Apa 
I site between the third and fourth quarters in pfit:Km#6. The 
Bt gene encoded by the synthetic gene in piasmid pBT:Km#6 has 
about 50*€0 % of the native proteins' activity against SC3, 
The 2 Xb Sma I\Bam HI fragment from pBt:Km#6 is inserted into a 
35S : expression cassette to make a piasmid called 355:Bt6. 

Two functional synthetic 3t clones r each with 
mutations, are initially obtained: plasmids pBT:KM#6 and 
pCIB4414. pCIB4414^ which is 100% active in insect bioassays. 
against Exiropean com borer compared with the native gene, 
contains substitution mutations in the third and fourth 
quarters at positions 1323, 1523, and 1634. 

pCIB4414 is constructed from two plasmids, MG3.G4#18 
and IHG which is described above. MG3.S4#18 is obtained by 
cloning the Apa I/Rpn I fragment in piasmid Bt.P2#l into 
PQCN103 (using those same restriction sites) . This produces a 
piasmid containing the third and fourth quarters of the gene. 
The first half of the synthetic gene from piasmid IHG is cut 
with Bam HI and Nco I and moved into MG3.G4#18 (containing the 
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t±Lird and fourtii quarrers of tii gene) . The resulting plasmid, 
pCIB4414r conrains a funcrional version of the synzh tic gene. 
While being functional^ the synthetic gene in this plasmid 
contains three errors; position 1226 (G substituted for a C) , 
position 1523 (substitute A for a G) , and at position 1634 
(substitution of a T for a C) - 

The fourth quarter in pCI34414 is replaced with a 354 
bp fourth quarter Apa INBst S II fragment obtained from 
hybridizing, ligaring/ and restriction cleaving fourth quarter 
oligomers as described earlier, and isolating the fragment from 
a 2% Nusieve agarose gel« pCIB4408 is a synthetic 3t gene 
clone obtained by replacing the fourth quarter fragment in 
PCIB4414 with the hybridized fourth quarter fragment- To 
insert the CaMV 35S promoter in front of the synthetic Bt gene, 
PCIB4406 is made from a 4 Kb Sco NI\£^n I fragment from plasmid 
p35SBtS and 1.8 Kb Eco I fragment from pCIB4408, 

PCIB440S is 100% active (as compared with the protein 
from the native gene) against SCB but contains the substitution 
mutation in the third quarter of the synthetic gene at position 
1323 resulting in an amino acid substitution of a leucine for a 
phenylalanine. Plasmid paS123#13 is used to repair this 
mutation . 

The third quarter fragment in plasmid pBS123#13.is made 
from an approximately 479 bp hybridized oligomer generated 
fragment. Third quarter oligomers U15-O20 and L1S-L21 are 
3cinased, hybridized, and ligated as described above. PGR 
reactions are carried out as described above with primers P5 (a) 
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and P6(b) for IS cycles. THe PGR producr is treated with 
proteinase K at a final concentration of about 50 ugXml in an 
approximate 95 ul volume for 30 minutes at 37 followed by 10 
minutes at 65*C (Crowe et al.. Nucleic Acid Research 19:184, 
1991.) Subsequently, tbe product is phenol \ chloroform extracted 
and ethanol precipitated using standard procedures before 
cutting with restriction enzymes Apa I and Nco I. 

The approximate 450 bp Apa I\Nco I PGR fragment is 
ligated to the 3.8 Kb Apa I\Nco I vector fragment from plHG#6 
to make pBS122#12. Plasmid pBS122#13 contains the desired 
sequence for the third quarter of the maize optimized cryIA(b} 
gene from position 1319 at the Nsp I site through the Apa I 
site at position 1493, This 170 bp Nsp I\Apa I fragment from 
pBS123#13 is used in the fully active synthetic crylA(b) gene 
in plasmid pCI34418. 

Western 31ot Analysis: 

Western blot analyses of various transf ormants are 
performed using crude extracts obtained from coli grown on 
selective plates. Using a toothpicJCr cultures are scraped from 
fresh plates containing the transf ormants of interest which 
have been grom overnight at 37»C. The positive control for 
expression of the Bt gene in coli was a construct called 
pCI33069 which contains the native 3t-}c gene fused with the 
plant expressible CaMV 35S promoter. pCIB3069 also contains 
the 35S promoter operably linked to the hygromycin resistance 
gene, 355 promoter, with Adh intron #1 operably linked to the 
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GUS gene, and 35S ?romot:er operably linJced ro a gene coding for 
tilie oroducrion of tiie native 3t cryIA(b> IP* A negaciv 
control of coli whicii does not contain a 3t gene is also 
included in tiie analyses. Cultures are resuspended in 100 ul 
of loading buffer containing 62 mM Tris-HCl pH 5.8, 1% SDS^ 
0.0025% bromophenol blue, 10% glycerol and 7.5% 
mercaptoetbanol . After heating the mixtures at 95 for 10 
minutes, the preparations are sonicated for 1-3 seconds. The 
debris is centrifuged in a microfuge at room temperature for 
about 5 minutes and 10 to 15 ul of each sample is loaded onto 
an acrylamide gel with a 10% running gel below a 5% stacking 
gel (Laemmli, Nature 227; 680-685 (1970) ) . After electrophoresis 
overnight at 10 mAmps, proteins are transferred from the gel to 
an Immofailon membrane (Miilipore) . The transfer is done using 
an electrophoretic Blotting Unit (American BioNuclear, 
Emeryville, CXi in transfer buffer (20 mM Tris, .150 mM glycine, 
and 20% methanol} for 1.5 hours at 450 mAmps. 
Buffers for western blotting included: 
Blocking buffer: 2% 7ween->20 

30 aM 7ris-aci ?H 10.2 
150 niM NaCl 



Hasb buffer: 0.05% Tween-20 

30 mM Tris-HCl ?H 10.2 
150 mM NaCl 



Deveiopiag buffer: 100 mM Tris-HCl pH 3.5 

74 



, wo 93/07278 



PCr/US92/08476 



100 mM NaCl 
10 inM MgC12 

Aftsr transfer is csnplete, zhe membrane is incubared 
for abouc ren minutes in tbe blocking buffer • Three 15 minute 
washes with wash buffer are done before the first antibody 
treatment. The first antibody is an immunoaffinity purified 
rabbit or goat antibody prepared using the CrylA(b) protein as 
the antigen (Ciba-Geigy, RTP, H.C.; Rockland Inc., 
Gilbert sviile, ?A.; and Berkeley Antibody CO., Richmond, CA.) . 
The crylA(b) specific antibody is treated immediately before 
use with coli lysate from Bio-Rad in a 1 ml volume with 5 w 
of antibody, 50 ul of S. coli lysate in the wash buffer 
solution. This mixture is incubated for 1 hour at room 
temperature before diluting it 1 to 30 for a final dilution of 
1:6000 with wash buffer. Incubation of the membrane with the 
first antibody is at room temperature for 1.5 hours. 

Three 10 minute washes are done between the 1st and 2nd 
antibody treatments . The second antibody is either rabbit 
anti-goat or goat anti- rabbit /alkaline phosphatase conjugate 
(Sigma, St. Louis, MO.) . Incubation with the alkaline 
phosphatase conjugate is carried out at room temperature for 
one hour using a 1 to 6000 dilution in wash buffer. Six 10 
minute washes are done between the second antibody 
treatment and developing the western blot. The western blot is 
developed in 100 ml of developing buffer with 440 yil of 
nitroblue tetrazolium in 70% dimethyl formamide (75 mg\ml) , and 
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330 ul of 5-bromo-4-chloro-indoiyi-?hosphai:e in 100% dimeciiyi 
formamide (SO mgXnii) . After developing for 15 zo 30 minutes r 
the membrane is washed in water and air dried. 

SXAMPLS 4: CONSTRUCTION OF TRANSFORMATION ^/ECTORS 

Construction of pCI3710 and derivatives. 

CaMV 3SS Promoter Cassette Plasmids ?Cia709 and ?CI3710 
are constructed as shown in Rothstein et al.. Gene .53 : 153-1 SI 
(1987) . PCI3710 contains CaM7 promoter and transcription 
termination sequences for the 3SS RNA transcript [Covey et al., 
Nucl. Acids. Res.. 9:6725-5747 (1981)]. A 1149 bp Bglll 
restriction fragment of CaMV DNA [bp 5494-7643 in aohn et al.. 
Current Topics in Microbiology and Immunology / 96:194-220 and 
Appendices A to G (1982) ] is isolated from CaMV DNA by 
preparative agarose gel electrophoresis as described earlier 
Tlie fragment is mised with BamHI -cleaved plasmld paC19 DNAr 
treated with T4 DNA llgase, and transformed into £^ coll . 
(Note the 3amHI restriction site in the resulting plasmld is 
destroyed by ligation of the Bglll cohesive ends to the BamHI 
cohesive ends.) 

The resulting plasmldr called paci9/355r is then used 
in oligonucleotide-directed in-vitro mutagenesis to insert the 
BamHI recognition sequence SGATCC immediately following CaMV 
nucleotide 7483 in the Hohn reference. The resulting plasmldr 
pCIB710, contains the CaMV 35S promoter region and 
transcription termination region separated by a BamHI 
restriction site. DKA sequences inserted into this BamHI site 
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will be expressed in plants by tiiese CaMV rranscripcion 
regulation sequences- (Also note tiiat pCI3710 does not contain 
any ATG translation initiation codons between the start of 
transcription and the 3amHI site) . 

PCIB710 is modified to produce pCIB709 by inserting a 
Bam HI fragment containing the coding sequence for hygromycin 
phosphotransferase from pLG90 [Rothstein et al«. Gene, 
53:153-161 (1987)1 in the Bam HI site. 

PCIB709 is modified to produce pCIB996 by removing the 
ATG just upstream from the initiation codon of the hygromycin 
phosphotranserase gene using standard mutagenesis techniques 
while inserting a Bgl II restriction site at this location. 
The resulting plasmid, pCIB996, is further modified to remove 
the 3am HI, Sma I and Bgl II sites in the 5' untranslated 
leader region located 5' of the initiation codon for the 
initiation codon. The result is a change of DNA base sequence 
from -TATAAGGATC CCGGGGGCA AGATCTGAGA TATG-Hyg to -TATAAGGATC 
TGAGATATG^flyg . The resulting plasmid is }cnown as pCIB3073. 

Alternatively, pCIB710 is modified to produce pCIB900, 
by inserting the Bam HI - Bel I fragment of pCIB10/355Bt, which 
contains the 545 amino acid 3t coding sequence, described in 
Part C4 below, into the Bam HI site of pCIB710 to create 
pCIB71Q/35SBt. To introduce an antibiotic resistance maricer, 
pCI3709 is cut with Sal I, a Kpn I /Sal I adaptor is ligated and 
the resulting ligation product is cut with Kpn I. The Kpn 
fragment of pCIB709 containing the 35S /hygromycin resistance 
gene is inserted into the Kpn I site of pCIB710/35SBt to 
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produce ?CI3900. 

Genes useful as tiie selecrable oiarJcer gene include the 
hygromycin resistance gene described in Rotilistein et al.r Gene 
33: 153-161 (1987) . The hygromycin gene described in zhls 
reference is moved into a pDC plasmid such as ?CI3710 or 
?CI3709 and the "extra" ATG upstream from the hygromycin 
phosphotransferase coding sequence is removed to create 
pCIB996. This modified ?CIB996 gene is further modified -o 
remove a Bglll, BamSI and Smal sites from the 5' region of the 
gene using standard techniques of molecular biology to make 
PCI33073. 

pCIB932 is a pOC19-based plasmid contai n i n g the 
chimeric gene ?ep-C:?romoter\Bt\Pep-C: terminator. It is 
comDosed of fragments derived from pPEP-lOr a Hindlll subclone 
of a genomic clone, Hl-lambda-14r ?M&S CT5A , 83:2884-2888 
(1986) , of the maize gene encoding the PEP carboxylase enzyme 
active in photosynthesis, and from pCIB930r which is a BamHI 
fragment containing the S4S amino acid truncated form of the 
the cryiab endotoxin gene in the BamHI site of pOClS. 

The 2.S W3 BcoRI-XhoI fragment from pPEP-10, conta in i n g 
the polyA addition site from the PSP carboxylase gene, is 
isolated and digested with PstI and Hindi. The restriction 
digest is ligated with ?stI/HincII digested paci8, transformed 
into coli and transf ormants screened for those containing a 
412 bp Pstl-HincII insert in pUClS and the insert verified by 
sequencing- The resulting plasmid is called pCI3931. 

The nuclear gene encoding the phosphoenolpyruvate 
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carD xylase isozyme (••Pep-C*; is described in Hudspecii et ai., 
Planr Molecular aioloav / 12: 579-589 (1989) . ?CI3932 is 
constructed by the ligation of tiiree fragxaents. The first 
fragment, containing tlie PEP-C transcription terminator, is 
produced by digesting pCIB931 to completion with Hindlll, 
partially witb SphI and tbe 3098 bp fragment isolated. The 
second fragment, containing the Bt endotoxin coding sequence, 
is produced by digesting pCIB930 with Ncol and SphI and 
isolating the 1950 bp fragment. The third fragment, containing 
the PEP-C promoter, is produced by digesting pPEP-10 to 
completion with flindlll, partially with Ncol and isolating the 
2.3 3cb fragment. The ligation mix is transformed into S. coli , 
transx ormants with the correct insertion identified and the 
insert verified by sequencing. 

pCIB932 is cut with PvuII to generate a 4.9 Kb fragment 
containing the maize Pep-C; promoter \Bt\Pep-C; terminator and 
purified on a 1% LGT agarose gel in IX TAB. The linearized 
PCI33079 vector and the 4.9 Kb insert from pCIB932 are ligated 
using T4 DN& ligase in LGT to make pCIB4401. pCIB4401 is a 
maize transformation vector containing the chimeric genes: 
35S:promoter\PAT\35S: terminator, Pep-C:promoter\Bt\Pep-C: 
terminator, and 35S:promotar\AdhI #1 intron\GUS\35S: 
terminator. 

Construction of pCI3246 (3SS-GnS-35S) 

A CaMV 35S promoter cassette, pCI324 6, is constructed 
as follows. 

The Ddel restriction site at nucleotide position 7482 
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of zhB CaMV genome [Francic ec ai.. Call . 21:285-294 (1980)] is 
modified by insertion of a 48 bp oligonucieocide concaining 
several restriction enzyme sites including an Ncol (CCATGG) 
site, a Sail (GTCGAC) site, and an SstI (GAGCrC) site. This 
altered CaMV^ 35S promoter is inserted into a paci9 vector tiiat 
had been modified to destroy the vector's SstI and Sail sites. 
Thus, the CaMV 3SS promoter of pCIBlSOO contains unique SstI 
and Sail sites for cloning. 

pCIBlSOO is digested with Sstl/Ncol and ligated with 
the GUS gene obtained from pBI221 (Clontech Laboratories r Inc., 
Palo Alto, ca.) . The Hcol site is fused to the CTS gene such 
that the ATG of the Ncol site functions as the start codon for 
the translation of the GUS gene. The CaMV 3SS polyadenylation 
and termination signals are used for the 3' end of the chimeric 
gene. 

Construction of pCI33069 {3SS-Adhl-GUS-3SS) 

PCI3246 is modified by adding the maize alcohol 
dehydrogenase gene Adhl intron number 1 (Adhl) (Dennis et ai.*. 
Nucleic Acids Research , 12:3983-4000 (1984)) into the Sal I 
site of pCIB246 to produce plasmid pCIB3007. The Adhl intron 
is excised from the maize Adhl gene as a Bal I/Pst I fragment 
and subdoned into pOClS that was cut with Sma I/Pst I to make 
a plasmid called Adh 1026. Adh 102S is cut with ?vu Il/Sac Ilr 
the fragments are made blunt ended with T4 DNA polymerase, Sal 
I linicers are added using standard procedtures and a fragment of 
about 560 bp is recovered from a 3 % NuSeive gel and ligated 
into Sal I cut/phosphatase treated pUClS. The Sal I linJcered 

80 



wo 93/07278 PCr/US92/08476 
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Adh inrron #1 in che resulting piasmid is cur out wizh Sal I, 
gel purified, and ligaced into Sal I cur/ nhospiiarase rreared 
pCIB246 to make plasmid ?CIB3007. 

pCIB3007 is cut witii PstI and the ends made blunt by 
using T4 DNA polymerase (NEW Sngland 3iolabs) according to the 
suppliers' specifications. The resulting bliint ended molecules 
are cut with Sph I and the approximately 5.8 Xb fragment with 
one blunt end and one Sph I end is purified on a low gelling 
temperature (LGT) agarose gel using standard procedures. 
pcr3900 is cut with Sma I/Sph I and the fragment containing the 
35S/Bt gene is purified on a LGT agarose gel. The two gel 
purified fragments are ligated in LGT agarose using T4 DMA 
ligase according to standard conditions. The resulting ligated 
fragments are transformed into coli using sta n da r d 
procedures and the resulting plasmid is called ?CIB3062. There 
are two versions of pCIB3062. pCIB3062#l has a Sma I site 
regenerated where the Sma I site and the T4 polymerase blunted 
ends are ligated. This most likely results from the T4 
polymerase nibbling a few base pairs from the Pst I site daring 
the blunting reaction. pCIB3Q62#3 does not have this Smal 
site. 

pCia30S2#3 is cut with Kpnl and made blunt -ended using 
T4 DUA polymerase r and subsequently cut with ?vu II to yield a 
5.4 Kb fragment with blunt ends containing the 35S/GUS and 
3SS/Bt genes. This blunt -end fragment is ligated into Sma I 
cut PCIB3073 to produce pCIB30S3 or pCIB3069. pc:33069 
contains the same fragment used to make pCIB3063, but the 
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cilimeric genes in pCia3Q69 are ali in "iie same relative 
orientation, unlike those in ?CI33063- These plasmids contain . 
a) a 35S promoter operaiaiy linked to zhe ixygromycin resistance 
gene; b) a 3SS promoter^ with Adh intron #1, operafaly linked to 
the GDS gene; and c) a 35S promoter operafaly linked to a gene 
coding for the production of the synthetic crylACb) 
insecticidal protein from Bacillus thurinaiensis , as described 
above « 
GUS Assays: 

GUS assays are done essentially as described in 
Jefferson, Plant Mol. 3io. Reporter , 5:387-405 (1987). As 
shown above, plasmid pCZ3246 contains a CatCV 35S promoter fused 
with the GUS gene. The 5' untranslated leader ox this c hi mer ic 
gene contains a copy of the maise Adhl intron #1. It is used 
here as a transformation control. Although the same amount of 
pCIB246 is added to each transformation, the calculated 
activity varied among at constructs tested. The values 
reported below are averages of 3 replicates. pCZB4407 was 
tested twice. 

PCIB3069 28 nM MO/ug/min 

PCIB4407 0.7 nM MD/ug/min, 2.3 nM MD/ug/min 

EXAMPLE 5A: ASSAY OF SYNTHETIC crylA(b) gene FOR INSECTICIDAL 

ACTIVITY AGAINST EUROPEAN CORN aORER 

The synthetic crylA(b) gene in pCIB4414 in E^ coli . is 
assayed for insecticidal activity against European com borer 
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according no rhe following prorocol. 

Molten artifical insecr diet is poured into a 60 aim 
Gellman snap*-cap perri dish. After solidification, S. coli 
cells r suspended in 0.1% Triton X-100, are- spread over the 
sxirface at a concentration of 3 X 107 cells/c3i2. The plates 
are air dried. Ten ^irst instar European com borer, Ostrinia 
nubilaiis , which are less than 12 hours old are then placed 
onto the diet surface. The test is inciibated at 30 C in 
conrpleta daricness for 2-5 days. At the end of the test percent 
mortality is recorded. A positive clone has been defined as 
one giving 50% or higher mortality when control E. coli cells 
give 0-10% background mortality. 

For comparison, the native crylA(b) gene in pCIB3069 is 
tested at the same concentration. Clones are tested at 3 x 10 

r 

calls /cm" diet; 20 insects per clone • 

The following results are observed: 



Clone Percent Mortality 
Control Q 
pCia3069 100 
PCIB4414 100 



These results indicate that the insecticidal crystal 
orotsin produced by the synthetic crylA(b) gene demonstrates 
activity against European com borer comparable to that of the 
IP produced by the native crylA(b) . Other plasmids containing 
a synthetic crylA(b) gene were assayed in a similar manner. 

EXAMPLE SAB: ASSAY OF CRYIA(b) PROTEIN FOR INSECTICIDAL 
ACTIVITY AGAINST SUGARCANE BORER. 

CrylA(b) was expressed in coli and assayed for 
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ins cricidal acrivity against Sugarcane borer ( Diarrea 
saceharalis ) according zo the same ororocoi used for European 
com borer/ described iimnediacely above. The results are 
summarized in tbe Table. 



SUGARCANE 30RER ASSAY WITH Bt PR0TSI2I FROM £. COLI 



Concentration Mortality 

(ng/g) CrylA(b) 

10 0 

25 0 

50 7 

100 13 

250 40 

500 53 

1000 80 

LC50 380 

95% CI 249-646 



The results indicate that the insecticidal protein 
produced by a maise optimized 3t gene is effective against 
Sugarcane borer. The upper concentrations of CrylA(b) protein 
250 ng/g-1000 ng/g, are achievable in transgenic maize plants 
produced in accordance with the instant invention. 



SXAMPLS 6: MAIZE gROT OPLAST ISOLATION AND TRANSFORMATION WITH 

THE SYNTHETIC 3T GENE 

Expression of the synthetic 3t gene is assayed in 
transiently transformed maize protoplasts. 



Protoplast Isolation Procedure: 

1. The contents of 10 two day old maize 2717 Line 5 
suspension cultures are pipetted into 50 ml sterile tubes and 
allowed to settle. All culture media is then removed and 
discarded. 
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2. Ceils (3-5 lai Packed Ceil Volume) are resuspended 
in 30 ml proroplasr enzyme solurion. Recipe follows: 
3% Cellulase RS 

1% Macerozyme RIO in KMC Buffer 
KMC Buffer (recipe for 1 liter) 



KCl 



MgClj-eHjO 
CaCl2-2H20 
MES 



8.65 g 
16.47 g 

12. SO g 
S.O g 



pH 5.6, filter srerilize 

3. Mix cells well and aliquot into 100x25 noa 
dishes r about 15 ml per plate. Shake on a gyratory shaker for 4 
hours to digest. 

4. Pipette 10 ml KMC through each 100 micron sieve to 
be used. Filter contents of dishes through sieve. Wash sieve 



with an equal volume KMC. 

5. Pipette sieved protoplasts carefully 



50 ml 



tubes and spin in a Beckman TJ-6 centrifuge for 10 minutes at 
1000 rpm (500 x g) . 

6. Remove stxpematant and resuspend pellet carefully in 
10 ml KMC. Combine contents of 3 tubes into one and bring 
volume to 50 ml with KMC. 

7. Spin and wash again by repeating the above step. 

8. Resuspend all washed protoplasts in 50 ml KMC. 
Count in a hemocytometer . Spin protoplasts and resuspend at 3 
X 10*^ /ml in resuspending buffer (RS Buffer) . 

RS Buffer (recipe for 500 ml) 



85 



wo 93/07278 



7US92/08476 



maxuii&ol 



27.33 g 



CaCl- (0.1 M srocJc) 75 ml 



MES 



Q .5 g 



pH 5.8r filter sterilize 
Protoplast Transformation Procedure : 

1. Aliquot 50 U9 plasoid DNA (5t IP constructs, bozh 
synthetic (pCIB44a7) and native (pCIB3069) ) to 15 ml 
polystyrene culture tubes. Also aliquot 25 U9 6US<*containing 
plasmid DNA (which does not contain Bt IP (pCIB246) to all 
tubes. 3 replications are used per construct to be tested, 
with 1 rep containing no DNA as a control. 
3t constructs: GU5 construct; 

PCIB3069 PCIB246 
PCIB4407 



0.4 M mannitol 

0.1 M Ca(N03J2-4H20 

pH 8.0, filter sterilize 

.4. Mix gently to combine protoplasts with PEG. Wait 30 



minutes . 

5. Sequentially add 1 ml, 2 ml, and S ml W5 solution at 
5 minute intervals. 
WS Solution: 



2. Gently m±x protoplasts well and aliquot 0.5 ml per 



tube. 



3. Add 0.5 ml ?EG'-40 to each tube. 



PEG-40: 
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154 mM NaCl 

125 mM CaClj-HjO 

5 mM KCl 

5 mM glucose 
PH 7.0^ filter sterilize 

6. Spin for 10 minutes in a Becicaian ?J-€ centrifuge at 
about 1000 rpm (500g) . Remove supernatant. 

7. Gently resuspend pellet in 1.5 ml FW media and plate 
carefully in 35x10 mm oetri disnes • 

FW media (recipe for 1 liter) : 



MS salts 4,3 g 

200X as vits. 5 ml 

sucrose 30 g 

proline l.S g 

mannitol 54 g 

2,4 D 3 mg 



pH 5.7, filter sterilize 

8. Incubate overnight in the dark at room temperature. 

9. Perform GUS assays, insect bioassays, and SLISA's on 
protoplast extracts as described below. 

EXAMPLE 7: CONSTRUCTION OF A FOLL-LENGTH SYNTHETIC MAIZE 

OPTIMIZED CRYIA(b) GENE 

Sequence 4 shows the synthetic maize optimized sequence 

encoding the full-length crylA(b) insecticidal protein from 3^ 

thurinciensis . The truncated version described above 

represents the first approximately 2 Kb of this gene. The 

remainder of the full-length gene is cloned using the 
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procedures described above. Briefly, tiiis procedure entails 
symiliesizing Did oligomers of 40 zo 90 NT in length, typically 
using 30 tziers as an average size. The oiigomers are purified 
using standard procedures of HPLC or recovery from a 
polyacrylamide gel. Purified oligomers are Icinased and 
hybridized to form fragments of about SOO bp. The hybridized 
oligomers can be amplified using ?CR under standard conditions. 
The 500 bp fragments r either directly from hybridizations ^ from 
PCS. amplification, or recovered from agarose gels after either 
hybridization or PCR. amplification, are then cloned into a 
plasmid and transformed into coli using standard procedures • 
Recombinant plasmids containing the desired inserts are 
identified, as described above, using ?CR and/ or standard 
miniscreen procedures. Inserts that appear correct based upon 
their PGR and/or restriction enzyme profile are then sequenced 
to id ent i f y those clones containing the desired open reading 
frame. The fragments are then ligated together with the 
approximately 2 Kb synthetic sequence described in Example 2 to 
produce a full-length maize optimized synthetic cryia(b) 
gene useful for expression of high levels of CrylA(b) protein 
in maize. 

G+C Content of native and synthetic St genes: 
Full-length native 38. S% 

Truncated native 37.2% 
Fuli-length synthetic 64.8% 
Truncated synthetic S4.6% 
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% homology of the final rrancarea version of tiie 3t gene 
relative co a ''pure* maize codon usage gene: 98.25% 
EXAMPLE 3 Consrrucrion of a planr expressible, full-length, 
hybrid partially maize optimized crylA(b) gene. 

pCIB4434 contains a full lengtb CrylA(b) gene 
comprised of about 2 Kb of the synthetic maize optimized 
crylA(b) gene witb the remainder (COOH terminal encoding 
portion) of the gene derived from the native gene. ThuSr the 
coding region is a chimera between the synthetic gene and the 
native gene, but the resulting protein is identical to the 
native crylA(b) protein. The synthetic region is from 
nucleotide 1-1938 (amino acids 1 to 64 6) and the native coding, 
sequence is from nucleotide 1939-3468 (amino acids 647 to 
1155) . The secpience of this gene is set forth in Fig. 7. A 
map of pCIB4434 is shown in Fig. 3. 

The following oligos were designed to maice pCIB4434: 

KE134A28 = 5' -CGTGACCGAC TACCACATCG ATCAAGTATC CAATTTAGTT 
GAGT-3' 

KE135A28 - 5' -ACTCAACTAA ATTGGATACT TGATCGATGT GGTAGTCGGTC 
ACG-3' 

XE136A28 = 5'-GCAGATCTGA GCTCTTAGGT ACCCAATAGC GTAACGT-3' 
XE137A28 = S'-GCTGATTATG CATCAGCCTAT-S' 
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KE138A2B » 5 ' -GCaGATC?GA GCTCTTATTC CTCCATAAGA AGTAATTC-3' 



MK0SA28 = 5'-CAAAGGTACC CAATAGCGTA ACG-2' 



MK35A28 = 5' -AAC3AGGTGT ACATCGACCG-3' 



pCIB4434 is made using a four-way ligation witii a 5.7 
Icb fragment from pCI34418, a 34 6 bp 3st B II\Kpn I 
PCR-generated synthetic: native fusion fragment^ a 108 bp Kpn 
I\Nsi I native CrylACb) fragment from pCIB1315, and a 224 bp 
Nsi I\Bgl II PCX-generated fragment. Standard conditions for 
ligation and transformation are as described previously. 

A synthetic: native gene fusion fragment is made in two 
steps using PCS.. The first 253 bp of the ?CR fusion fragment 
is made using 100 pmols of oligos KE134A28 and MK04A28 with 
approximately 200 ng of native cryIA{b) template in a 100 ui 
volume with 200 nm of each dNTP^ 1 X PGR buffer (PerJcin Elmer 
Cetus) r 20 % glycerol, and 5 units of Taq polymerase (PerJcia 
Elmer Cetus) . The PCS reaction is run with the following 
parameters: 1 minute at 94*Cr 1 minute at 55*Cr 45 seconds at 
72*Cr with extension 3 for 3 seconds for 25 cycles. A fraction 
(1 %) of this first PGR reaction is used as a template along 
with 200 ng of the synthetic crylA(b) DMA to make the complete 
351 bp synthetic: native fusion fragment. Oligos used as PGR 
primers in this second Pd reaction are 50 pmols of MK35A2S, SO 
pmols of MK04A28, and 25 pmols of KE135A28. The PCR reaction 
mix and parameters are the same as those listed above. The 
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resultant 3S1 bp synch ticrnacive fusion fragmenr is creaced 
with Proteinase K at 50 ug\ial total concentration and 
phenol \ chloroform extraction followed by ethanol precipitation 
before cutting with Bst S IIXKpn I using standard conditions. 

The 224 bp Nsi I\Bgl II PGR fragment used in making 
pCIB4434 is made using 100 pmols of oligos KE137A28 and 
KE138A28 and 200 ng of the native crylA(b) gene as template in 
100 ul volume with the same PGR reaction mix and parameters as 
listed above. The 230 bp PGR native crylA(b) fragment is 
treated with Proteinase K, phenol \ chloroform extracted^ and 
ethanol precipitated as described above , before cutting with 
Nsi I\Bgl II. 

pCIB4434 was transformed into maize protoplasts as 
described above. Line 6 2717 protoplasts were used with 
pCIB4434 and pCI34419 as a control for comparison. The results 
are shown below: 

ng Bt/mg protein 
4419(353) 14,400 i 2,100 

4434 (full-length) 2,200 ± 900 

Background = 13 ng Bt/mg protein for untransf ormed protoplasts 



The results indicate that ?CIB4434 expresses at a level 
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of about 15% of OCIB4419. 

Wesrern blot analysis shows at least one'-tiiird of tbe 
crylA(b) protein produced by pCIB4434 in tbis system is about 
130 JcD in size- Therefore , a significant amount of fuil-iengtb 
cryIA{b) protein is produced in maize cells from the expression 
of PC1B4434. 

SXaMPLE 7. Construction of a full-lengtb, crylACb) genes 
encoding a temperature-stable crylA(b) protein. 

Constructs ?CIBSS11-551S, each containing a 
fuil-lengtb, crylA(b) gene are described below. In these 
sequences, the 26 amino acid deletion between amino acids 793 
and 794, KCGEPNRCAPHLEWNPDLDCSCfflGS, present in crylA(a) and 
crylAtc) but not in crylAtb), has been repaired « The gene in 
PCIB5513 is synthetic; the other four genes are hybrids, and 
thus are partially maize optimized. 

Construction of pCZB5511 

This plasmid is a derivative of pCIB4434. A map of 
PCI35511 is shown in ?ig. 10* A 435 bp segment of DNA between 
bp 2165 and 2590 was constructed by hybridization of synthetic 
oligomers designed to represent the upper and lower strand as 
described above for the construction of the truncated crylA(b) 
gene. This segment of synthetic DNA is synthesized using 
standard techniques 3cnown in the art and includes the 26 amino 
acid deletion found to occur naturally in the crylACb) protein 
in Bacillus thurinaiensis )curstaki HD-1. The entire inserted 
segment of OHA uses maize optimized codon preferences to encode 
amino acids. The 26 amino acids used to repair the naturally 
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occurring deletion are conrained witiiin riiis fragment:* They are 
inserted startling at position 23B7 between zhe Kpnl site at nt 
2170 and the Xbal site at nt 2508 (2586 in pCIBSSll) of 
OCI34434. oCIBSSll is constructed via a three way ligation 
using a 2.2 Kb fragment obtained by restriction digestion of 
pCIB4434 with Sphi and Kpnl, a 3.8 Kb fragment obtained by 
digestion of pCIB4434 with SphI and Xbal, and a 416 bp fragment 
obtained by digestion of the synthetic DMA described above, 
with Kpnl and Xbal. Snzymatic reactions are carried out under 
standard conditions. After ligation, the DNA mixture is 
transformed into competent coli ceils using standard 
procedures. ?ransf ormants are selected on L'-agar containing 100 
ug/ml ampicillin, Plasmids in transf ormants are characterized 
using standard mini-screen procedures. The sequence of the 
repaired cryIA(b> gene encoding the cryIA{b) temperature (heat) 
stable protein is set forth in ?ig. 9. 

Construction of pCIB5512 
This plasmid construct is a derivative of pCIB4434. 
A map of PCI3S512 is shown in Fig. 12. DNA to repair the 26 
amino acid deletion is prepared using standard techniques of 
DNA synthesis and enzymatic reaction. Three double stranded DHA 
cassettes, pGFcasl, pGFcas2 and pGFcas3, each about 300 bp in 
size, are prepared. These cassettes are designed to contain the 
maize optimized codons while maintaining 100% amino acid 
identity with the insecticidal protein. These cassettes are 
used to replace the region between restriction site BstZII at 
oosition 1B24 and Xbal at position 2508 and include the 
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inserrion of riie addirionai 78 bp wnicii encode riie aiissing 25 
amino acids (described above for pCIBSSll in ?C134434) . Sacii of 
these casaerres is cloned inro the ScoRV site of the vector 
Bluescript (Stratagene) by standard techniques. The three 
cassettes are designed to contain overlapping restriction 
sites. Cassette 1 has restriction sites BstEII at the 5' end 
and ScofW at the 3' end: cassette 2 has ScoRV at the 5' end and 
Clal at the 3' end and cassette 3 has Clal at the 5' end and 
Xba I at the 3' end. They are cloned individually in Bluescript 
and the the complete 762 bp fragment is subsequently assembled 
by ligation using standard techniques. pCIBS512 is assembled 
using this 762 bp fragment and ligating it with a 6.65 Kb 
fragment obtained by a complete digestion of pCZB4434 with 
BstEII and a partial digestion with Xbal. Alternatively^ a four 
way ligation using the same vector and the three cassettes 
digested with the specific enzymes can be employed* Enzymatic 
reactions are carried our under standard conditions. After 
ligation, the ONA mixture is transformed into competent coli 
cells using standard procedures . Transf ormants are selected on 
L-agar containing IQO ug/ml ampicillin. Plasmids in 
transformants are characterized using standard mini-screen 
procedures. The resulting plasmid is pCIBS512. The sequence 
of the repaired crylA(b) gene is illustrated in Fig. 11. This 
repaired crylA(b) differs from that carried in pCIBSSll in that 
a larger region of the crylA(b) coding region is optimized for 
maize expression by using maize preferred codons. 

Construction of pCI355i3 
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This plasmid contains a repaired crylA(b) gene derived 
from pCIBS512« A map of ?CI3S513 is shown in Fig. 14. The 
region 3' from the Xbal site at position 2S86 to the end of the 
gene (Bglll site at position 3572) is replaced entirely with 
maire optimised codons. This region is synthesized, using 
standard techniques of DMA synthesis and enzymatic reactionr 
well Icnown in the art, as foxar double stranded DNA cassettes 
(cassettes # 4,5 ,6 ,1) . Adjacent cassettes have overlapping 
restriction sites to facilitate assembly between cassettes. 
These are Xbal and Xhol at the 5' and 3' ends of cassette 4; 
Xhol and Sad at the 5' and 3' ends, respectively, of cassette 
5; Sad and BstXI at the 5' and 3' ends, respectively, of 
cassette 6; and BstXI and Bglll at the 5' and 2' ends, 
respectively, of cassette 7. As described for pCIB5512, the 

r 

cassettes are cloned into the blunt^end BcoBV site of the 
Bluescript vector (Stratagene) and the full-length "repaired* 
crylA(b) gene cloned either by sequential assembly of the above 
cassettes in Bluescript followed by ligation of the complete 
967 bp synthetic region with a 6448 bp fragment obtained by a 
complete digestion of pCIBS512 with Bglll and a partial 
digestion with Xbal. Alternately, the plasmid containing the 
full-length genes is obtained by a 5-way ligation of each of 
the four cassettes (after cleavage with the appropriate 
enzymes) and the same vector as above. The sequence of the 
full-length, "repaired" crylA(b) gene is set forth in Fig. 13. 
The protein encoded by the various synthetic and 
synthetic/native coding region chimeras encode the same 
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protein- Tliis protein is riae heat-stable version of crylA(b) 
produced by repairing tbe naturally occurring 26 amino acid 
deletion found in tbe crylA(b) gene from Bacillus tburinqiensis 
Jcurstaici HD-1 vhen the homologotzs region is compared with 
either crylA(a) or cryia(c> Bacillus tburinqiensis 
delta-endotoxijis • 

Construction of pCIB£514 
This plasmid is a derivative of pCIB4434. .A map of 
pCIB5514 is shown in Fig. 16. It is made using synthetic DNA. 
cassette #3 (see above) which contains a maize optimized 
sequence of the region between the Clal site (position 2396) 
found in the 26 amino acid thermostable region and the Xbal 
site at position 2508 in pCIB4434 (2586 in pCIBSSll) . The 
region between at 2113 of ?CIB4434 and the junction of the 
thermostable region is ?CR amplified by using pCIB4434 as 
template with the following primers; 
forward: S'GCaCCSaaATCaCCATCCaAGGaGGCSaTGaCSTATT^ 
reverse : 

3'-Aa::GCMCSaSTC3GCrCCCCGCACTTGCCGarTG^ . 

The PCa product is then digested with restriction 
enzymes Kpnl and Clal and ligated in a four part reaction with 
a 189 bp fragment obtained by digestion of cassette 3 with Clal 
and Xbal, a 3.2 Kb fragment of pCI34434 digested with SphI and 
Kpnl, and a 3.8 Kb fragment of pCI34434 obtained by digestion 
with SphI and Xba. Enzymatic reactions are carried out under 
standard conditions. The ligation product is transformed into 
competent 3 coli cells, selected with aaroicillin and screened 

■ 
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using standard procedures described above. The sequence of the 
repaired crylA(b) gene contained in pCiaS514 is shown in Fig. 
15. 

Construcrion of pClBSSlS 
OCIB4434 was modified by adding the 78bp Geiser 
thermostable element (Geiser TSE) , described above, between 
the Kpn I site (2170 bp) and the Xba I site (2508 bp) in the 
native BtJc region. The exact insertion site starts at the 
nucleotide #2379. The region containing the Geiser TSE was 
amplified by two sets of PCS reactions r i.e. the Kpn I - Geiser 
TSE fragment and the Geiser TSE - Xba I fragment. 

PGR primer#l: (Kpn I site) 

5' - ATTACGTTAC GCTATTGGGT ACCTTTGaXG - 3' 

PCR primeri2: (CSeiser TSE bottom) 

5/ - TCCCCGTCCC TGCAGCTGCA GTCTAGGTCC GGGTTCCaCT 
CCAGGTGCGG AGCSCATCGA TTCGGCTCCC CGCACTTGCC 
G&7TG&AC7T GGGGC7GA - 3' 

PGR primerf3: (Geiser TSE top) 

5' - CaAGTGCGGG GAGCCGAATC GATGCGCTCC GCACCTGGaG 

TGGAACCCGG ACCTAGACTG CAGCTGCAGG GACGGGGAAA 

AATGTGCCCA TCATTCCC • 3' 



PGR primer#4: (Xba I site) 

5' - TGGTTTCTCT TCGAGAAATT CTAGATTTCC - 3' 
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After ti amplification, tiie PGR fragments were 
digested witli CKpn I -k Cla I) and (Cla I ^ Xba 1) , 
respectively. THese two fragments were ligated to tixe Kpn I 
and Xba I digested pCI34434 . T&e resulting construct pCIBSSlS 
is PC1B4434 with a Geiser rSE and an extra Cla I site flanked 
by Kpn I and Xba I. A map of pCiaSSlS is illustrated in Fig- 
SB. The cryia(b) gene contained herein, which encodes a 
temperature stable crylA(b) protein, is shown in Fig. 37. 

Esampies 9-20 set forth below are directed to the 
isolation and characterization of a pith-preferred promoter. 
EXAMPLE 9. RMA Isolation and Northern Blots 

All RMA was isolated from plants grown tinder greenhouse 
conditions. Total RNA was isolated as described in Kramer er 
al.. Plant Physiol,, 90:1214-1220 (1990) from the following 
tissues of Funk maize line SN984: 3, 11, IS, 25, 35, 40, and SO 
day old green leaves; 8, 11, IS, 2S, 35, 39, 46, SO and 70 day 
old pith; SO and 70 day old brace roots from Funk maize line 
5N984; 60 and 70 day 5N9a4 sheath and ear stock. HNA was also 
isolated from 14 day 211D roots and from developing seed at 
weeicly intervals for weeks one through five post-pollenation. 
Poly A+HNA was isolated using oligo-dT as described by 
Sambrook et al.r Molecular Cloning: A Laboratory Manual (2nd 
ed.), 1989, and Northern blots were carried out, also as per 
Sambrook et al. using either total RMA (30 ug) or poly A-f- RMA 
(2-10 ug) . After electrophoresis r RNA was blotted onto 
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Nitroplus 2000 membranes (Micron Separarions Inc) . Tlie ?NA was 
linked the filter using the Stratalinker (Straragene) at 0.2 
mJoules. The northerns were probed with the 1200 bp ScoRI pith 
(TRpA) 8-2 cDNA fragment, isolated by using 0.8% low melting 
temperature agarose in a T3E buffer system. Northerns were 
hybridized and washed and the filters exposed to film as 
described in Isolation of cDNA clones. 

SXftMPLE 10. Isolation of cDNA Clones 

First strand cDNA synthesis was carried out using the 
3RL AMV reverse transcriptase system I using conditions 
specified by the supplier (Life Technologies, Inc., 
Gaithersburg, MD) . Specifically, 2S \xl reactions containing 50 
mM Tris-HCl pH 8,3, 20 mM KCl, 1 mM DTT, 6 mM MgC12, 1 mM each 
of each dNTP, 0.1 mM oligo (dT) 12-18, 2 ug pith poly(A+) RNA, 
100 yg/ml BSAr 50 pg/ml actinomycin D, 8 units placental RNase 
inhibitor, 1 ul (10 mM Ci/ml) 32P dCTP >3000 mCi/mM as tracer, 
and 20 units AMV reverse tr<mscriptase were incubated at 42 •€ 
for 30 min. Additional KCl was added to a concentration of 50 
mM and incubation continued a further 30 min. at 42 •C. KCl was 
added again to yield a final concentration of 100 mM. 
Additional AMV reverse transcriptase reaction buffer was added 
to maintain starting concentrations of the other components 
plus an additional 10 units, and the incubation continued at 
42 for another 30 min. Second strand isynthesis was completed 
using the Riboclone cDNA synthesis system with Eco RI linkers 
(Pr omega, Madison, WL) . Double stranded cDNA was sized on an 
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1% agarose gel using Tris-borats-SDTA buffer as disclosed ia 
Sambrooic ec al.« and snowed an a^rerage size of abour 1.2 Kb* 
The cDNA was size f racrionated using HA45 DEAE membrane so as 
to recain tbose molecules of abouc IQOQ bp or larger using 
conditions specified by tibe supplier (Srhleicber and ScHuell) . 
Size fracrionaced cOMA was ligaced into tbe Lambda ZapII vector 
(Stratagene^ La Jolla^ CA) and packaged into laabda particles 
using Gigapacic II Plus (Stratagene, La Jolla^ CA) . The 
unamplified library had a titer of 215 r 000 pfu while the 
amplified library had a titer of 3,5 billion/ml using PLK-?' 
cells . 

Recombinant phage were plated at a density of 5000 pfu 
on 150 X I5am L*agar plates. A total of 50 r 000 phage were 
screened using duplicate lifts from each plate and probes of 
first strand cDNA generated from either pith derived mRNA or 
seed derived mRNA. The lifts were done as described in 
Sambrooic et al. using nitrocellulose filters. DNA was fixed to 
the filters by OV crosslinJcing using a StratalinJcer 
(Stratagene, La Jolla, CA) at 0*2 oJoule. Prehybridization and 
hybridization of the filter were carried out in a solution of 
lOX Denhardts solutionr 150 u9/ml sheared salmon sperm DNA, 1% 
SDSr SO znM sodium phosphate pE 7, 5 mM SDTA, SX SSC, 0.05% 
sodium pyrophosphate. Prehybridization was at 62 for 4 hours 
and hybridization was at 62 for 18 hours (overnight) with 1 
million cpm/ml in a volume of 40 ml. Filters were washed in 
500 ml of 2X SSC, 0.5% SDS at room temperature for IS min. then 
at S3«C in O.IX SSC, 0.5% SDS for 30 min. for each wash. 

100 



wo 93/07278 



PCr/US92/08476 



Radiolabeled DNA probes were made using a BRL random prime 
labeling sysrem and unincorporarsd coiincs removed using Miclc 
Columns (Pharmacia) . Filters were exposed ovemigiit to Kodak 
X-Omat AR X-ray film with (DtiPonr) Cronex Lighcning Plus 
intensifying screens at -80*C. Plaques showing hybridization 
with the oith-derived nrobe and not the seed-derived probe were 
plaque purified for further characterization. 

EXAMPLE 11. Isolation of Genomic Clones 

Genoxaic ONA from Funk inbred maize line 2110 was 
isolated as described by Shure et al., Call , 35:22S--233 (1988). 
The DNA was partially digested with Sau 3A and subsequently 
size fractionated on 10-40% sucrose gradients centrifuged in a 
Beckman SW40 rotor at 22,000 rpm for 20 hours at 20»C. 
Fractions in the range of 9-23 Kb were pooled and ethanol 
precipitated. Lambda Dash II (Stratagene) cut with Bam HI was 
used as described by the supplier. The library was screened 
unamplified and a total of 300,000 pfu were screened using the 
conditions described . above . The library was probed using pith- 
specific (TrpA) CDMA clone 3-2, pCI3S600 which was identified 
in the differential screen of the cDNA library. Isolated 
clones were plaque purified and a large scale phage preparation 
was made using Lambdascrb (Pr omega) as described by the 
supplier. Isolated genomic clones were digested with Sco RI 
and the 4.8 kb ScoRI fragment was subcloned into Bluescript 
vector (Stratagene) . 
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EXAMPLE 12. DNA Sequence and Compucer Analysis 

Nucleotide sequencing was perrormed using zhe dideoxy 
diain-ceraination metiiiod disclosed in Sanger et al.r 
PNAS r 74 :5463-5467 (1377). Sequencing primers were synciiesised 
on an Applied aiosyscents model 3 8 OB ONA SYnthBsLzer using 
srandard conditions. Sequencing reactions were carried out 
using the Seqpienase system (US Biochemical Corp.) . Sel 
analysis was performed on 40 cm gels of S% polyacrylamide with 
7 M urea in Tris-Borate-SDTA buffer (BRL Gel-Mix 6) . Analysis 
of sequences cind comparison with sequences in Senfianic were done 
using the U. of Wisconsin Genetic Computer Group Sequence 
Analysis Software (UWGCG) • 

SXAMPLS 13, Mapping the Transcriptional Start Site 

?rimer extension was carried according to the procedure 
of Metraux et al., PNAS , 86; 896-900 (1988). Briefly, 30 ug of 
maize pith total RNA were annealed with the primer in 50 mH 
Tris pfl 7.S, 40 mM KCl, 3 mM MgC12 (RT buffer) by heating to 
80 *C for 10 minutes and slow cooling to 42*C. The RNA/primer 
mix was allowed to hybridize overnight. Additional RT buffer, 
DTT to 6 mM, 3SA to 0.1 mg/ml, RNAsin at 4 tl/ml and dUTP's at 1 
mH each were added. Then 8 units AMV reverse transcriptase were 
added and reaction placed at 37 *C for one hour. The primer used 
was 5'-CCGTTCGTTC CTCCTTCGTC GAGG-3S which starts at +90 bp 
relative to the transcription start. See Fig. 29A. A 
sequencing ladder using the same primer as in the primer 
extension reaction was generated using the 4.8 Kb genomic clone 
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to allow determinacion of zhe transcriptional start site. The 
sequencing reaction was carried out as described in Exan^le 12. 

RMase protection was used to determine if the the 371 
bp sequence from +2 bp to -^373 bp (start of cDNA) was 
contiguous or if it contained one or more introns. A 385 bp 
SphI*NcoI fragment spanning -^2 bp to -^387 bp relative to 
transcriptional start see ?ig. 29B was cloned into p6SH*5Zf (-*-) 
(Promega) and transcribed using the Riboprobe Gemini system 
(Promega) from the SP6 promoter to generate radioactive 
antisense RMA probes as described by the supplier. RMase 
protection was carried out as described in SambrooJc et al. 
pBR322 (cut with Hpall and end labelled with 32P-dC7P) and 
Klenow fragment were used molecular weight marJcers . Gels were 
S% acrylamide/7M iirea (BRL Gel-Mix 6) and were run at 60 watts 
constant power. 

SXAMPLZ 14, Genomic Southern Blots 

Genomic DNA. was isolated from maize line 211D using the 
procedure of Shure et al.r supra. 3 ug of genomic OlUl were 
used for each restriction enzyme digest. The following enzymes 
were used in the buffer suggested by the supplier: BamHI, 
ScoRI, EcoKV, Hindlllr and Sad. Pith cDNA clone number B-2 
was used for estimating gene copy number. The digested ONA was 
run on a 0.7% agarose gel using Tris-3orate-SDTA buffer system. 
The gel was pretreated with 250 mM HCl for IS min. to 
facilitate transfer of high molecular weight DNA. The DNA was 
transferred to Nitroplus 2000 membrane and subsequently probed 
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wit:*! ZhB pith cDNA 8-2. 
in Example 10. 

SXAMPLS 15, PCa Material and Metliods 

PCa reactions were pref oraied using the GeneAmp DNA 
Amplication reagent kit and An^liTaq recombinant Taq DNA 
polmerase (Pericin Slmer Cetus) . Reaction condition were as 
follows: 0.1 to 0.5 uM of each of the two primers used per 
reaction r 25 ng of the pith 4.8 Kb ScoRI fragment in 
Bluescriptr plus the PGR reaction mis described by the supplier 
for a total volume of 50 uL in 0.5 mL GeneAmp reaction tube 
{Per.^in Slmer Catus) . The DNA Thermal Cycler (Pericin Slmer 
Catus) using the Step-Cycle program set to denature at 94 for 
60 Sr anneal at 55 *C for SO a, and extend at 72 *C for 45 s 
followed by a 3-s-per-cycle- extension for a total of 30 cycles. 
The following primer sets were used: I. 83 X 84^ -429 bp to -2 
bp; II. 49 X 73, -€9 bp to +91 bp; III. 38 X 41, +136 bp to 
+258 bp; and IV. 40 X 75r +239 bp to +372 bp. These are marked 
on Fig. 24. 

SXAMPLS 16. Isolation of a Pith-Preferred Gene. 

A. cONA library derived from pith mRNA cloned into 
Lambda Zap and screened using first strand cDMA derived from 
either pith or seed mRNA. Clones which hybridized with only 
the pith probe were plaque purified and again screened. Clones 
passing the second screen were used as probes in northern blots 
containing RKA from various maize tissues. 
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SXAMPT'g^ 17. Gen Stracrure and Sequence Analysis. 

The 1.2 Kb inserr of the cDNA clone 8-2 was sequenced 
using the dideoxy method of Sanger ec al., supra. Likewise r 
rhe genomic equivalent contained on a 4.8 2Qd ScoRl fragment 
Bluescript denoted as pCIBSfiOl, was sequenced. This 
information revealed that the genomic copy of the coding region 
spans 1.7 Kb and contains five introns. The mRNA transcript 
represents six exons. This is shown in Fig. 24. The exons 
range in size from 43 bp to 313 bp and the introns vary in size 
from 76 bp to 130 bp. The entire sequence of the gene and its 
corresponding deduced amino acid sequence are shown in Fig. 24. 

This gene encodes a protein of 346 ami no acids with a 
molecular mass of about 38 ]cD. As illustrated in Table 1, the 
predicted protein shows 52% similarity and 41% identity with 
the subunit protein of Pseudomonas aeruginosa and has high 
homology with trpA proteins from other organisms. 

Table 1 

Conservation of TrpA sequences between a maize TrpA gene and 
other organisms. 

% amino acid % amino acid 

Similarity Identity 

Haloferax volancii 56.4 36.1 

Methanococcus voltae 58.1 35.1 

Pseudomonas aeruginosa 62.5 41.8 

Neurosoora crassa 61.4 39.3 

Saccharomvcas cerevisiae 56.7 36.1 
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Similariry groupings, I=L*M=V, D=E, F-Y, K-R, N«Q/ S=T 

Similarities and indentities were done using zhe GAP program 
from UWGC3. 

Crawford et al., Ann. Rev. Microtoiol> , 43:567-o00 
(1989) , incorporaced herein by reference, found regions of 
conserved amino acids in bacrerial rrpA genes. These are amino 
acids 49 to 58, amino acids 181 to 184, and amino acids 213 to 
216, with the rest of the gene showing greater variability than 
is seen in the TrpB sequence. An alignment of Icnown trpA 
proteins with the maize TrpA protein (not shown) illustrates 
that the homology between the maise gene and other trpA 
laroteins is considerable. Also, it is comparable to the level 
of homology observed when other TrpA proteins are compared to 
each other as described in Crawford et al,, supra. 

To determine the location of the transcription start 
site and whether or not there were introns present in this 
region, four polymerase chain reaction (PCR) generated 
fragments of about 122 bp to 427 bp from the region -429 bp to 
-j-372 bp were used for northern analysis. The results of the 
northerns sbowed that PCa probes II, III, IV hybridized to pith 
total RMA and PCR probe I did not hybridize. This indicated 
that the transcription start was in the -59 bp to +90 bp 
region. To more precisely locate the transcriptional start 
site, primer extension was employed. Fig. 29A shows that when 
a primer (#73) located at +90 bp relative to the 
transcriptional start is used for primer extension, the 
transcriptional start sire is located at +1, 1726 bp on the 
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genomic sequence. 

Tlie first ATG from the tirauiscriptional starr site is at 
-rll4 bp. This is tlie ATG that would be expected to serve as 
the site for translational initiation. This ATG begins an open 
reading that runs into the open reading frame found in the cDNA 
clone. The first 60 amino acids of this predicted open reading 
frame strongly resemble a chloroplast transit peptide. See 
aerlyn et al. ?WAS , 86:4604-4608 (1989) and Neumann-Karlin et 
al., 5MB0 5:9-13 (1986). This result suggests that this 
protein is targeted to a plasrid and is likely processed to 
yield the active protein. Transient expression assays in a 
maize mesophyll protoplast system using a maize optimized B.t. 
gene driven by the trpA promoter showed that when the ATG at 
+114 bp is used as the fusion point, the highest levels of 
expression are obtained. Using either of the next two ATGs in 
the sequence substantially reduces the level of expression of 
the reporter gene. The ATG at -^390 bp gave some activity, but 
at a much lower level than the h*ii4 ATG, and the ATG at 4*201 bp 
gave no activity. 

Athough a number of TATA like boxes are located 
upstream of the upstream of the transcriptional start site at 
H-l bp, the TATAAT at -122 bp is most like the plant consensus 
of TATAAA. See Joshi, Nuc . Acids Res. , 15:6643-6653 (1987). 
The presumptive CCAAT like box was found at -231 bp . The 
nucleotide sequence surrounding the ATG start (GCGACATGGC) has 
homology to other maize translation starts as described in 
Messing et al., Genetic Snaineerinc of Plants: An Agricultural 
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PersTPecrlve . Plenum Press, ?p. 211-227 (1983), but differs from 
that considered a consensus sequence in plants (ANNATGGC) . 
See , Joshi, above. The presumptive poly (A) addition signal is 
located at 3719 bp (AATAAA) on the genomic sequence, 52 bp from 
the end of the cDNA. The sequence matches known sequences for 
maize as described in Dean et al., Nuc, Acid s Res., 
14:2229-2240 (1986), and is located 346 bp downstream from the 
end of protein translation. See Dean et al., Nue, Acids Res., 
14:2229-2240 (1986) . The 3' untranslated sequence of the cDNA 
ends at 3775 bp on the genomic sequence. 

Fig. 28 shows a Southern blot of maize 211D genomic DMA 
with the approximate gene copy number as reconstructed using 
pith gene 8-2 cDNA. From the restriction digests and 
reconstruction there appear to be 1-2 copies of the gene 
present per haploid genome. There do not appear to be other 
genes with lower levels of homology with this gene.. Therefore, 
rh-f <? represents a unique or small member gene family in maize. 

3XAMPLS 13. RNase Protection 

The structure of the 5' end of the mRNA was determined 
using RNase protection. The RNase protection was carried out 
using a probe representing 385 nt from +2 bp to +387 bp- This 
region from the genomic clone was placed in the RNA 
transcription vector p(5EM-52f (+) and a 32P labelled RNA probe 
generated using S?6 polymerase. The probe and the extra bases 
from the multiple cloning site produce a transcript of 461 nt. 
The probe was hybridized with total pith RNA and subsequently 
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digested with a mixture of RNase A and Tl and til protected 
fragments analyzed on denaturing poiyacryiamide gels. Analysis 
of the gels shows a protected fragment of about 255 nt and 
another fragment of about 160 nt. See Fig, 29B. 

The fact that primer extension using a primer (#73) at 
+80 op produces a product of 90 NT in length argues that the 
5' end of the transcript is located at position +1 bp. Primer 
extension from a primer in this region produces a product, so 
one would expect this also to be detected by the RNase 
protection assay. This primer is located in the 5' region of 
the RNase protection probe. The cDNA clone contains sequences 
present in the 3' end of the RNase protection probe and hence 
were expected to be protected in this assay. Since only one 
band is present on the gel which could account for both of 
these sequences r we are confident that the protected fra gment 
is indeed the larger band and that the smaller single band is 
an artifact. If there were an intron in this region, fragments 
from each end would be present in the probe, and hence would be 
detectable on the gel. Of the two bands seen, one of them 
appears to represent the entire 5' region, therefore we do not 
believe that there is an intron located in this region. 

SXAMPLS 19. Complementation of coli TrpA Mutant with the 
Pith cDNA 3-2 

a. coli strain CSSC strain 5531 from the E. coli 
Genetic Stock Center, Yale University (O.H. Smith lab strain 
designation, #MS004) with chromosomal maricers glnA3, 'TrpAd825, 
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i-, IN(rmD-rraB) , rlii-l as described in Mayer er ai., Mol. Gen. 
Gentet., 137;131-142 (1975), was transforaed witii eitiier tiie 
pitii (TRpA) cDNA 8-2 or Sluescript piasmid (Stracagene) as 
described in Sambrooic et ai., supra. Tbe transf oraanrs 
containing tbe TrpA cDHA 8-2 had the ability to grow without 
the presence of tryptophan on miniTnal medium whereas the 
tiransformants with the Bluescript (Stratagene) piasmid or 
untransformed control were not able to grow without tryptophan. 
The ceils transformed with the maize TrpA gene grew very slowly 
with colonies visible after seven days gro%rt:h at room 
temperature. All strains were grown on M9 minimal me di um 
supplemented with 200 ug/ml glutamine, 0.01 ug/ml thiamine and 
with or without 20 ug/ml tryptophan. All transf ormants were 
checked for the presence of the appropriate pia smi d by 
restriction enzyme analysis. Colonies growing in the absence 
of tryptophan all contained clone 8-2 containi n g the cDNA for 
the putative maize TrpA gene, as confirmed by Southern 
hybridization (data not shown) . These results support the 
conclusion that this is the maize tryptophan synthase subunit A 
protein . 

SXAMPLE 20. Gene Expression 

The expression pattern of the pith-preferential gene 
throughout the plant was examined. Different maize genotypes 
were also examined for partems of expression of this gene. 
The following tissues were used as the source of HNA for these 
studies: upper r middle, and lower pith, brace roots, ear 
shank, cob in genotype 5N984; upper, middle, lower pith, 10 day 
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Old leaves r 14 day old roots and pith from the entire plant in 
genotype ZllD^ and seed from genotype 211D which had been 
harvested at weekly intervals one to five weeks 
post -pollination. Lower pith is derived from, i.e. constitutes 
the two intemodes above hrace roots; middle pith is derived 
from the next three intemodes; upper pith represents the last 
two intemodes before the tassel in 60 and 70 day plants. Only 
two intemodes were present in 39 day old plants and three 
intemodes for 46 day old plants. Northem blot analysis shows 
that transcripts hybridising with a probe derived from the pith 
cDNA accumulate rapidly in young pith and young leaf. As the 
age of the plant increases and one moves up the stalk, there is 
a significant decrease in the amount of transcript detected. 
See Figs- 2SA-D. At no time is message from this gene detected 
in seed derived RKA, either total RNA or poly A+ RNA. See Fig, 
26. Transcript is also detected in root, earshank, and sheath 
but not at the high levels detected in the pith and yoting leaf 
tissues. See Figs. 253, 25C. Some message is detected in 
brace roots, but only at a very low level. See Fig. 2SD. Six 
maize undifferentiated callxis lines were analyzed by northem 
blot analysis and no expression was found for this gene (data 
not shown) in any callus sample. The level of expression of 
this gene is extremely high since a very strong signal to a 
probe from TrpA gene 8-2 can be detected in pith and leaf as 
little as two hours after exposure of the blot to film (Fig. 
25A) . The amoxint of mRNA made is comparable to that derived 
from the maize phosphoenolpyruvate carboxylase gene disclosed 
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in Hudspecii et al,, ?lanr Mel, 3ioloaV / 12:S79--589 (1989), 
anotlier ixigiily expressed aiaise gene. Hudspetiii is incorporared 

herein by reference. 

The expression pattern of this gene is not teinporally 
constant. Sxpression is very high in the lower and aiiddle pith 
of plants less than 60 days old and decreases rapidly near the 
top of the plant. As the plant reaches maturity, e.g. over 70 
days oldr the expression drops to nearly xmdetectable levels 
except in the lower pith and earshanlc . The accumulation of 
transcript in young leaf is nearly as high as that seen in 
lower pith but expression decreases rapidly and is undetectable 
in leaves over 40 days of age. Sxpression in leaf was found to 
be variable depending on the season when it is grown. 

Sxanples 21*39 set forth below are directed to the 
isolation, characterization and expression analysis of a 
pollen-specific promoter according to the present invention* 

Identification of pollens-specific proteins 
Bxample 21. Maise Plant Growth 

Maize plants (Zea mays ?uziJc inbred 21 ID) were grown 
from seed in a vezmiculite/sand mixture in a greenhouse under a 
16 hour light/ 8 hour dark regime. 
Example 22. Total Pollen Protein Isolation 

Mature pollen was isolated from maize plants at the 
time of maximum pollen shed. It was sieved to remove debris, 
frozen in liquid nitrogen, and a 3*4 ml volume of frozen pollen 
was ground in a mortar and pestle with an equal volume of 
75-150 urn glass beads. 40 ml of grinding buffer (2mM SDTAr 5mM 
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DTTr 0.1% SDS, 100 3kM Hepes pH 3) was added and mixture was 
ground again. The glass beads and intacr pollen grains were 
pelleted by low speed cenrrifugarionr and mixture was clarified 
by cenrrifugarion at 10,000 g for 15 zoinures. Protein was 
precipitated from tiie supernatant by addition of acetone to 
90%. 

Example 23. Pollen Sxine Protein Isolation 

Sxine Protein was isolated from maize 2110 shed pollen 
as described in Matousek and Tupy, J. , Plant Physiology 
119:169-178 (1985) . 
Example 24. Leaf Protein Isolation 

Young leaves (about 60% expanded) were cut from the 
maize plant the midrib removed. Total protein was isolated as 
for pollen, except that the material was not frozen and 
grinding was in a Waring blender without glass beads. 
Example 25. Kernel Protein Isolation 

Ears with fully developed, but still moist kernels were 
removed from the plant and the Icemels cut off with a scalpel. 
Total protein was isolated as for leaves. 
Example 25. Gel Electrophoresis of Maize Proteins 

Pollen, leaf and kernel proteins were separated on SDS 
polyacrylamide gels as described in SambrooJc et al. Molecular 
Cloninor A Laboratory Manual. Cold Spring Harbor Laboratory 
Press: New Yor3c (1989) , Following staining by Coomasie blue, 
protein bands from pollen, leaf and kernel were compared and 
abundant proteins of approximately 10 kDrl3 kD, 20 kD, 45 kD, 
55 kD and 57 kD were determined to be pollen specific. 
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idenrif icarion of Polien*3peci.f ic cDNA clones 
Sxample 27. Parrial Sequence Dersraination of ?ollen*Specific 



Prorein bands determined to be pollen**specxfxc were 
purified by elecrroblocting from tiie polyacryiamide gel onto 
PVDF membrane (Matsudaira, P., J. 3iol, Chem, 261:10035-10038 
(1987) ) or by reverse phase HPLC. N- terminal sequence of tbe 
purified proteins was determined by automated Sdman egradation 
witb. an Applied Biosystems 470A gas*pbase sequencer. 
Phenylthiobydantoin (PTH) amino acids were identified using an 
Applied Biosystems 120A PTH analyzer. To obtain internal 
sequencer proteins were digested with endoproteinase Lys~C 
(Boehringer Mannheim) in 0.1 M Tris-HCl, pfl 8.5, for 24 hours 
at room temperature using an exizyme: substrate ratio of 1:10. 
Resulting peptides were isolated by HELC using an Aquapore C-8 
column eluted with a linear acetonitrile/isopropanol (1:1 
ratio) gradient (0 to S0%) in 0.1% TFA« Sequence of isolated 
Lys^C peptides was determined as above. The following 
sequences were determined for the 13kD pollen-specific protein: 



H-teratinus : TTPLTFQVGKGSKPGHLILTPNVATI 



LysC SI 
LysC 54 
LysC 49 
LysC 43 



KPGBLILTPNVATISDWTK 
SGSTRIADDVIPADFK 
SHGGDDFSFTLK 
EGPTGTWTLDTX 



Sxample 28 . 



Synthesis of Oligonucleotide Probes for 
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Pollen-Specific cONAs 

Regions of peptide sequence in the 13kD protein vritii 

low codon redundancy were selected^ and suitable 
oligonucleotide probes for the gene encoding these regions were 
synthesized on an Applied Biosy stems 3 BOA synthesizer. The 
following oligonucleotides were synthesized: 

Oligo #51 5'-AA ATC ATC ACC ACC ATG TTC-3' 

6 G G G C 

T 
C 

Oligo # 58 S'-CC TTT ACC CAC TTG AAA-3' 

C G C G 

T 

where the columns of nucleotides represent bases that were 
incorporated randomly in equal proportions at the indicated 
position in the oligo. Oligo #51 encodes the amino acid 
sequence SHGSDDF foimd in peptide LysC 49, and Oligo #58 
encodes the amino acid sequence FQVGKG found in peptide 
N-terminus. Use of these mixed oligonucleotides to screen a 
cDNA library for the pollen-specific gene will be described 
below. 

Sxample 29. Construction of a maize pollen cONA library 
Total maize RNA from maize 211D shed pollen was 
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isoiared as described in Glisen ec ai, 3iochemxst:rv 
13:2633-2637 (1974) . Poly A+ mRNA was purified from total 3MA 
as described in Saobrooic er al. Using tbis mBNA.r cDNA. was 
prepared using a cDNA synrbesis 3cit purcbased from Promega, 
following protocols stipplied witb tbe kit. The ScoRI linkers 
were added to tbe cONA and it was ligated into arms of tbe 
cloning vector lambda Zap^ pxircbased from Stratagene and using 
tbe protocol supplied by tbe manufacturer. Tbe ligation 
product was pacicaged in a lambda pacJcaging extract also 
purcbased from Stratagene, and used to infect coli BB4 
cells . 

Sxample 30. Isolation of pollen-^specific cDNA clones 

Tbe maize pollen cDNA library was probed using tbe 
syntbetic oligonucleotides probes specific for tbe I3kD protein 
gene^ as described in Sambrook et al. Briefly, about 100 r 000 
pbage plaques of tbe pollen cDNA library were plated and lifted 
to nitrocellulose filters. Tbe filters were probed using 
oligonucleotides #51 and #58 wbicb had been 32P end-labeled 
using polynucleotide ]cinase. Tbe probes were hybridised to 
tbe filters at low stringency (50 degrees C in IM NaClr 10% 
dextran sulfate, 0.5% SDS) , washed 30 minutes at room 
temperature and then 30 minutes at 45 degrees C in oX SSC, 0.1% 
SDS, and exposed to X-ray film to identify positive clones. 
Putative clones were purified through four rounds of plaque 
hybridization. Three classes of cDNA clones were isolated. 
Type I contained ScoRI fragments of 0.2 kfa and 1,8 Icb. Type II 
contained ScoRI fragments of O.S )cb, 0,5 )cb and 1.0 Icb, and 
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Type III contained an EcoRI fragmenr of 2.3 3cb. 

Sxampie 31. Characterization of Pollen-specific cDNA clones 

The BcoRI fragments of Zh Type II cONA clone were 
subcloned into the plasmid vector pflluescript SK+, purchased 
from Stratagene. See Fig. 30. The 0.5 jcfa fragment in 
pflluescript was named II-.S, the 0.5 jcb fragment in pBluescript 
was named II-. 5 (later renamed pCIB3169) and the 1.0 Jcb 
fragment in pBluescript was named II -1.0 (later renamed 
PCIB31S8) . As will be described below, the 0.5 kb and 1.0 Icb 
fragments encode the maize pollen-specific CDPK gene. HNA from 
anthers, pollen, leaf, root and silk was denatured with 
glyoxal, electrophoresed on a 1% agarose gel, transferred to 
nitrocellulose, and probed separately with the three ScoRI 
fragments that had been labeled with 32P by random primer 
extension as described in Sambrook et al. Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press: New 
York (1989) • The blots were exposed to X-ray film, and an. mRNA 
band of approximately 1.5 kb was identified with the 0.6 kb 
fragment probe, while the 0.5 and 1.0 kb fragments hybridized 
to an approximately 2.0 kb aRNA. In all cases hybridization 
was only seen in the pollen RHA lane, with the exception that 
the 0.6 kb fragment showed a slight signal in anther mRNA. The 
conclusion from these data was that the original cDNA clone was 
a fusion cDNA molecules derived from two different mRNAs. The 
0.6 kb fragment was a partial cDNA of a i.S kb pollen- specific 
mRNA, and this mRNA encodes the peptides LysC 49 and 
N-terainus. The 1.0 and 0.5 kb fragments comprise a partial 
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cDNA. of a 2-0 Icb pollen-specific loRNA unrelaced co tie peptides 
and oligonudeoride prob s used for probes. This conclusion was 
verified when the fragments were sequenced using title did e oxy 
n^^si'in termination method as described in SambrooJc et al. The 
cD£UL sequence is shown in Fig. 31. 

Example 32 • Determination of specificity of mRNA expression 

To determine if the 2.0 ]cb RIA represented by cONA 
clones pCIB3169 and pCIB31sa were present only in pollen, total 
HIZA was isolated from maize 211D roots r leaves y pollen, an t h e rs 
or silks. The RNAs were denatured with glyoxal, 
electrophoresed on a 1% agarose gel/ transferred to 
nitrocellulose, and probed with 32?-labeled ScoRZ insert from 
plasmid pCIB31ffS or pCIB3169r all using standard techniques as 
described in SambrooJc et al. Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Laboratory ?ress: New York (1989) • 
Exposure of this blot to photographic film demonstrates that 
the gene represented by these two clones is only 
transcriptionally active in the pollen (Fig. 32) . 

Identification of a Pollen-Specific Promoter 
Example 33. Construction of a Maize Genomic DSA Library 
Genomic DNS. from maize line 211D young shoots was 
isolated as described in Shxire et 1, Call 35:225-233 (1983) . 
The DNA was provided to Stratagene, where a genomic DNA library 
was constructed by cloning Sau3AI partially digested DNA into 
Stratagene's Lambda Oash cloning vector. 

Example 34. Genomic DNft. Blot Hybridization to Determine Gene 
Copy Number. 
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Genomic DNA from maize line 211D was digested with a 
niimber of restriction enzymes ^ tiie individual digests 
electrophoresed on an agarose gel« transferred to 
nitrocellulose, and probed with 32P-labeied ScoRI insert from 
plasmid pCia3168 (1.0 kb fragment), pCIB3169 (0.5 }cb fragment) 
or clone II-. 5 using standard techniques described in Sambrook 
et al. More than 10 bands were detected by the II-. S probe on 
most digests, indicating that this cDNA is derived from a 
large, multigene family. Probing with the 1.0 kb fragment 
detected from 3 to 6 bands, and probing with the 0.5 kb 
fragment detected only from 1 to 3 bands which were a subset of 
those detected by the 1.0 kb fragment. Due to the smaller gene 
family size detected by the 1.0 kb and 0.5 kb fragments, it was 
decided to attempt to isolate the genomic clone corresponding 
to them. 

Sxanple 35. Isolation of a pollen-specific genomic clone 

The Stratagene maize 211D genomic library was screened 
by probing plaque lifts with 32P labeled insezrts from plasmid 
PCIB3168 (1.0 kb fragment) and pCI33169 (G.S kb fragment) using 
standard procedures as described in the Stratagene manual 
accompanying the library. Using this strategy. Lambda clone 
MG14 was isolated, and it hybridized to both probes. The 9.0 
kb 3amHI fragment of MG14, which also hybridized to both 
probes, was subdoned into the 3amHI site of pBluescript SK+ to 
create plasmid pCI3379. 1800 bp of pCia379, in the region 
corresponding to the cONA sequence, was sequenced a^ described 
above. Co mp a r ison of the cDNA and genomic sequences showed 
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only 91% identity. pCIS379 ijiserr repr sents a relarsd 
pollen-specific crene. 

A second maize 211D genomic library was construcred in 
the veccor laxobda SSM-llr ptirehased from Promega^ using tHe 
procedures described in tbe Promega manual. Screening tbis 
un->amplified library as above yielded clone GEMll-1, wbicb 
hybridized to botb O.S and 1.0 5cb probes. The 20Icb Hindlll 
fragment of GBMll-1, wbicb also hybridized to both probes, was 
subcloned into the Hindlll site of pBluescript SK+ to yield 
pCIB31Sg. The DN& sequence of 4.1}cb of pCIB3166 was determined 
(Fig. 36) and after accounting for six introns in the genomic 
doner was 100% identical to the cSNA sequence of pCIB3168 and 
PCIB3169. CoB^arison of the pCQ3166 sequence to the 
Genbank/SHBL database revealed that the 5' port i on r through the 
3 exour was 34.6% identical to rat calmodulin-dependent protein 
]cinase II at the amino acid level (Fig. 33), while the fourth 
through seventh exons were 39.4% identical to rar calmodulin. 
See Fig. 34. No other pollen-specific kinase has been 
described, and at the time this a protein combining ]cinase and 
calmodulin domains was unicnown. Subsequently, Harper et al.. 
Science 252:951-954 (1991) have disclosed the cDNA sequence of 
a si m ila r protein from soybean r although this gene is not 
pollen-specific in expression. Comparison of the soybean 
calcium-Dependent Protein Kinase (CTPK) and the maise pollen 
C3PK reveals 38% identity at the amino acid level. See Fig. 
35. 

Example 36. Identification of the Promoter's Transcriptional 

120 



wo 93/07278 PCr/US92/08476 

Starr Site by Primer Sxtension 

Oligonucleotide ?E51f with the following sequence was 
synthesized as a primer* 

5' -TGGCZCATGGCTGCGGCSGGSAACSAG7GCSGC-3' 

Primer extension analysis was carried out on polyA*r- 
pollen mRNA as described in Metraux et al«, PNAS USA 86:896-890 
(1989) . The transcription initiation site was determined to be 
between bases 1415 and 1425 on the partial sequence of pCI33166 
shown in Fig. 36. 

Testing Promoter Function in Transgenic Plants 

Example 37. Construction of promoter vectors for plant 
transformation 

To demonstrate that the pollen CDPK promoter can drive 

expression of a linked gene in transgenic plants r a gene fusion 

of the pollen CDPK promoter to the Beta-glucuronidase gene of 

coli was constructed as follows. The lOkb BamHI fragment 

from lambda GSMll-i containing the first exon and part of nh ft 

first intron of the pollen CDPK gene plus 9 )cb upstream of the 

gene was subcloned into the BamHI site of pBluescript SK+ to 

create plasmid ?CIB3167. The 2.3 kb BamHI -Hindlll fragment 

from PCIB3167 was subcloned into the BamHI and Hindlll sites of 

pBluescript SK+ to create plasmid pSKlOS. The pSKlOS was 

digested with Aval and Hindlll, and thel.75 kb Hindlll-Aval 

fragment was isolated on an agarose gel. A PCR reaction was 

run under standard conditions as described in Sambrook et al. 

using intact pSKlOS as a template and the following primers: 

#42 : 5' -AGCGGTCGACCTGCAGGCaTGCGATCTGCACCTCCCGCCG-3' 
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#43: 5'-ATGGGCAAGGaGCTCSGG-3 

The PCI reaction produces were digesced wirii Aval and 
Sail and the restilting fragment: isolated on an agarose gel. 
pBluescript SK+ was digested witii Hindlll and Sail. The 1.75 
kb Hindlll-Aval fragment/ PCH derived Aval -Sail fragment ^ and 
pBluescript vector with Hindlll and Sail ends were ligated in a 
three way ligation to create plasmid pSKllO. 

A fusion of the promoter fragment in pSKllO to the 
Beta-*glucuronidase (GQS) gene was created by digesting pSKllO 
with Hindlll and Sail, isolating the l.Skb fragment on an 
agarose gel and ligating it into Hindlll and Sail sites of 
PCIB30S4, to create plasmid pKL2, a plasmid derived from paC19 
containing the GUS gene followed by plant intron from the maize 
PEPC gene and a polyA signal from cauliflower mosaic virus. 
This promoter fusion was inactive in plants, probably due to 
the presence of out of frame ATG codons in the leader sequence 
preceding the GUS gene ATG. 

A function fusion of the promoter was created by 
digesting ?KL2 with Xbal and Sail to remove the previous fusion 
jtmction. A new fusion junction was produced in a PCR reaction 
using pSKlOS as a template and the following primers: 
#SKSO : 5' -CCCTTCAAAATCrAGAAACCT-3 ' 
#SK49 : 5' -TAATGTCSACGAACSGCGAGAGA7GGA-3 ' 

The PCR product was digested with Xbal and Sail and 
purified on an agarose gel. The purified fragment was ligated 
into the Xbal and Sail sites of pKL2 to created plasmid 
PCIB3171. This plasmid contains a functional fusion of pollen 
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CDPK promoter and GDS which direcrs expression Zh GUS gene 
exclusively in pollen. 

To create a vector containing the pollen CDPK 
promoter-GDS fusion suitable for use in Aorobacterium 
tumefaciens- mediated plant transf ormationr the fusion gene was 
isolated from ?CIB3171 by digestion with Hindlll and Sail. The 
resulting fragment was ligated into the Eindlll and Sail sites 
of pBIlOl (purchased from Clontech) to create plasmid pCIB3175. 
Example 38. Production of Transgenic Plants 

PCIB3175 was transformed into Agrobacteriua tumefaciens 
containing the helper plasmid pCIBS42, and the resulting 
culture used to transform leaf dislcs from tobacco shoot tip 
cultures as described by Eorsch et al.. Science 227:1229-1231 
(1985) except that nurse cultures were omitted and selection 
was on 100 mg/1 kanamycin. Transgenic plants were regenerated 
and verified for presence of the transgene by PCS.. 
Example 39. GUS Gene Expression Analysis 

Pollen from primary transf ormants and their progeny 
were analyzed histochemically for expression of the GUS gene as 
described by Guerrero et al., Mol. Gen. Genet. 224:161-168 
(1990) . The percentage of pollen grains expressing the GUS 
gene, as demonstrated by blue staining in the X-gluc buffer, is 
shown in the table below. 



Plant Number % Blue Pollen 

PPl-51 28% 

PPl-54 54% 

PPl-55 none 

PPl-61 very few 

PP1-S3 51% 
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PP1-S7 15% 
PPl-80 10% 

ppi-a3 12% 

Primary transforaants in whicii a single pollen CDPK 
promoter-GUS gene was integrated would produce a maximum 50% 
GtJS positive pollen due to segregation of the single gene. 

Flouometric GUS assays were done on pollen , stem, root, 
leaf and pistil tissue of selected plants to demonstrate tiie 
specificity of pollen CDPK promoter expression. Assays were 
performed as described in Jefferson, Plant Mol. Biol. 
14:995-1006 (1990), and CTS activity values are expressed as 
nmoles MD/ug protein/minute. 



Plant 


Tissue 


GUS Activity 


nntransfocme< 


i Net 


number 




Plant GUS 
Activity 


Acti 


5P1-51 


stem. 


0.01 


0.02 


0 




leaf 


0 


0 


0 




cool: 


O.IS 


0.10 


0.05 




pjgcil 


0.02 


0.01 


0.01 




pollen 


0.24 


0.02 


0.22 


PPl-54 


stem 


0.01 


0.02 


0 




leaf 


0 


0 


0 




root 


0.13 


0.1 


0.03 




pistil 


0.01 


0.01 


0 




pollen 


0.€0 


0.02 


0.5B 


PPl-€3 


stsa 


0.01 


0.02 


0 




leaf 


0 


0 


0 




root 


0.07 


0.1 


0 






0.01 


0.01 


0 




pollen 


0.57 


0.02 


0.55 



Examples 40-50- are directed primarily to the preparation of 
ciiimeric constructs, i.e. recombinant DNA. molecules, conta inin g 
constitutive, tissue-preferred, or tissue-specific promoters 
operably linked to an instant 3.t . gene, insertion of same into 
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vecrors, production of rransgenic piatins conraining tbe 
vecrorSf and analysis of expression levels of 3. t ■ proteins of 
Zh,B Transgenic plants. 

SXAMPLE 40: C3NSTBDCTI0N OF M21IZE OPTIMIZED BT 

TRANSFORMATION VECTORS 

To demonstrate tlie effectiveness of the synthetic 3t 
crylA(b) gene in maize^ tiiie PepC and pith specific promoters 
are fused to the synthetic Bt crylA(b) gene using PCR. 
Oligomers designed for the PGR fusions were: 

<PEPC) 

KE99A28 » 5 ' -TGCGGTTACC GCCGATCaCATG-3' 

KE97A28 = 5 ' *GCSSTACCSC 6TCSACSCSG ATCCCSCSGC SGGAA6CTAAG-3' 

(PITH) 

KE10QA28 » S'-GTCGTCGACC GCaACA-3' 

KE98A28 - 5' -GC3GTACCSC GTTAACGCGG ATCCTGTCCG ACACCGGaC-3' 
KE104A28 = S'-GATGTCSTCS ACCSCAACAC-3' 

KZ103A28 =» 5'-GCSGTACCGC 3GATCCTGTC CGACACCGGA CGGCT-3' 

PGR primers are designed to replace the Nco I sites in 
the 5' untranslated leader region of each of these tissue 
specific genes (containing ATG translational start sites) with 
Bam HI sites to facilitate cloning of the synthetic crylACb) 
gene into this Bam HI site. Subseauent construction of vectors 
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containing zhe tissue specific promoters fused to tiie syntlietic 
crylA(b) gene and also containing the 3SS:PAT:35S marker gene 
involves several intermediate constructs. 

1. PCIB4406 (35S:syntlxetic-cryIA(b) :?epC ivs#9:35S) 

pCIB440S contains the 2 Kb 3am ai\Cla I synthetic 
crylA(b) gene fused with the CaMV 35S promoter (Rothstein at 
al.. Gene 53:153-161 (1987)). The gene also contains intron #6 
derived from the maise PEP carboxylase gene (ivs#9) in the 3' 
untranslated region of the gene, which uses the CaMV 3' end, 
( PNAS USA , 33:2884-2888 (1986)/ Hudspeth et al., giant 
Molecular Biology , 12: 579-589 (1989)). pCIB4406 is ligated 
and transformed into the "SURE*" strain of coli cells 
(Stratagene, La Jolla, (SI) as described above. One mutation is 
found in pCIB4406's cryl&(b) gene at amino acid #436 which 
resulted in the desired Phe being changed to a Leu. pCIB4406 
is fully active against European com borer when tested in 
insect bioassays and produces a CrvIA(b) protein of the 
expected size as determined by western blot analysis. 

2. PCI34407 {2SS : synthetic-cry lA {b ) :pepC ivs#9:35S 

35S:PAT:35S) 

pCIB4407 is made from an approximately 4 Kb Hind 
IIIXEco RI fragment containing the 35S:PAT:35S gene^ and the 
3.1 KbXHind IIIXEco RI 3SS : synthetic-cry lA (b ) :3SS gene from 
PCI34406. PCI34407 is ligated and transformed into "SDRE', 
DHSalpha, and HBlOl strains of E^ coli using standard 
procedures (SambrooJc et al.). The synthetic crylA(b) gene has 
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the same properri s as its precursor pCIB4406. 

3. ?CIB4416 (35S:syntJiecic-cryIA(b) :pepC ivs#9:3SS + 

3SS:PAT:35S + 3SS:Adh incron:GUS :35S. ) 

pCIB4407 is cue witii Sco RI and created with calf 
intestinal alkaline phosphatase (CI?) under standard conditions 
(Saxnbrook et al.) to produce an about 7.2 Kb fragment that is 
ligated with a 3.4 Kb Eco RI 35S :Adh\GUS :35S fragment to 
produce pCIB4416. Ligations and transformations into "SORE" 
cells is as described above. The synthetic crylA(b) gene in 
PCI3441S has the same properties as the gene in pCIB4406. 

4. PCIB4418 ( 3SS: synthetic-cry lA (b ) :pepC ivs#9:35S) 

PCIB4406 is digested with Apa I and Bam HI and treated 
with CI?. PCIB4406 is digested with Bam HI and Nsp I. 
pBS123#13 is digested with Nsp I and Apa I. A three-way 
ligation is made consisting of a 4.3 Kb Apa I\Bam HI fragment 
from ?CIB4406r a 1.3 Kb Bam HI\Nsp I fragment from ?CIB4406r 
and a 170 bp Nsp I\Apa I fragment from pBS123#13 to form 
pCIB44ia. The host coli strain for pCIB4418 is HBIOI. 

5. PCI34419 {35S:synthetic-cryIA{b) :pepC ivs#9:35S + 

3SS:PAT:3SS + 3SS:Adh intron:GUS :3SS. ) 
PCIB441S and pCIB44ia are digested with Bst S II and 
SCO NI and fragments of pCI3441S are treated with CI?. A 9.1 
Kb fragment from pCIB441S ligated to a 1.4 Kb fragment from 
PCIB4418 to form pCIB4419. pC134419 transformed in HBlOl 
competent 3^ coli cells demonstrates full activity in insect 
bioassays against European com borer. 
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6. PCI34420 (Pitii:synt::ietic-cryIA(b) :PSPC ivs#9:35S ^ 

35S:PAT:25S) 

Inrermediare consrructs in making pCI34420 are pBTinl, 
pBtinZ, p4420A and pBtin3. pBtinl (pitiii proiaoter: second half 
of tHe synriietic Bt gene + 3SS:PAT:2SS) is made by ligating the 
2.1 Kb Xba IXNco I pith promoter fragment from plasmid 
pith (3-1) with a 5.2 Kb Xba I\Nco I fragment from pCIB4407. 
pBtin2 is an intermediate construct containing the pith 
oromoter modified with a 210 bo PGR fragment made using primers 
KE100A28 and KB38A28 listed above. The PCR reaction mix 
contains approximately 100 ng of a 2.1 Kb Bam HiVNco I pith 
promoter fragment with 100 pmol of each oligomer r 200 nM of 
each dNTP, 1 X buffer (Catus) and 2.5 units of thermal stable 
polymerase. Since the TSa is relatively low (between 40* and 
5Q*C) , PCa reactions are run with the following parameters: 

denaturation cycle: 94 for 1 minute 

annealing cycle : 37 •€ for 1 minute 

extension cycle : 72*C for 45 seconds (-t- 3 seconds per 

cycle) 

number of cycles: 25 

PCS. reactions are treated with proteinase K as 
described above prior to cutting with Sal I\Kpn I followed by 
phenol \ chloroform extraction and ethanol precipitation as 
described above. The 210 hp fragment is purified on a 2 % 
Nusieve gel and extracted from the gel using Millipore's filter 
units. The 210 bp Sal I\Kpn I fragment is ligated to the 4.9 
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Kb Sal I\Kpn I fragaenr from ?it:ii(3-l) to maice pBtin2. p4420A 
(pith isynthecic-Bt: Pep intron:25S + 25S:PAT:3SS) is made with a 
three-way ligation consisting of a TOO bp Nsi I\Bam HI fragment 
from pBtin2, a 1.8 Kb Bam HIXBst S II fragment from pCIB44iar 
and a 5.9 Kb Bst S II\Nsi I fragment from pfltinl. After p4420A 
is made three mutations are discovered in pBtin2. A second ?CR 
fragment is made to modify the Nco I site in the pith leader 
using primers KE104A28 and KE103A28 with 7m values around 65 *C. 
The ?CR reaction mix is identical to that listed above with the 
addition of glycerol to 20% to reduce mutations in rich 
areas (Henry et al., Plant Molecular Bioloov Reporter 
9 (2} :129-144, 1991). PCH parameters are as follows: 
File I: 94«C : 3 minutes , 1 cycle 
File II: S0»C : 1 minute 

94*C : 1 minute 

25 cycles 

File III: 72 : 5 minutes, 1 cycle 

PCR reactions are treated as above and cut with 
restriction endonucleases Sal I and Kpn I. The 210 bp Sal 
I\Kpn I PCSl (glycerol in the reaction) fragment is ligated to 
the 4.9 Kb Sal I\Kpn I fragment from plasmid pith (3-1) to make 
pBtin3. Sequence data on pBtin3-G#l shows this PGR generated 
fragment to be correct. 

pBtin3-G#l is used to make pCIB4420 (also called p442aB 
"G#6'') . pCI34420 is constructed with a three-way ligation 
using the 700 bp Nsi I\Bam HI fragment from pBtin3-<3#l, a 1.8 
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Kb 3am HI\Bst: S II fragmenr from ?CI34418, and a 5.9 Kb 3st S 
II\Hsi I fragment from oBtinl. ?CIB4420 is used in masopiiyil 
proroplast: experiments and demonstrat s full activity of tiie 
syntiietic crylA(b) gene against European com borer. 

7. pCIB4413 (PEPC:syntlietic-Bt (Phe mutation) :PSPC intron:35S.) 

A fusion fragment is generated by PGR using primers 
KB99A28 and KE97A28 with a 2.3 KB Hind III\Sal I template from 
oGnS4.5. The PCR Tniy contains the same concentration of 
primers, templater dNTPs, salts r and thermal stable polymerase 
as described above. PGR reaction parameters are: 

denaturation cycle : 94 •C for 1 minute 

annealing cycle : 55 for 1 minute 

extension cycle : 72 •C for 45 seconds (+ 3 seconds per 

cycle) 

number of cycles: 30 

After completion, PGR reactions are treated with 
proteinase K followed by phenol \ chloroform extraction and 
ethanol precipitation as described above prior to cutting with 
restriction endonucleases Bam HI and Bst B II. 

pCI34413 is made with a three-way ligation using the 
210 bp Bam HlXBst B II PCS fragment r a 4.7 Kb Bam HI\Hind III 
fragment from pC134406r and a 2.2 Kb Hind IIINBst S II fragment 
from pGUS4 .5. 

8. pCI34421 (PSPC : synthetic-cry lA (b) :PEPC intron:3SS.) 

PCIB4421 is made to replace the synthetic cryIA{b) gene 
containing the Phe mutation in pCI344l3 with the synthetic 
cr7lA(b) gene from pCIB4419. pCI34421 is made by ligating a 
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5.2 Kb Ban HI\Sac I fragment from pCI34413 witii a 1.9 Kb Bam 
HI\Sac I fragment from pCIB441d. 

9. pCIB4423 (PEPC:syntbetic-cryIA(b) :PepC iatron;3SS ^ 
3SS:PAT:3£S) 

The 2.4 Kb Bam HI\Hind III PEPC promoter fragment from 
PCIB4421 is ligated to the 6.2 Kb Bam HI\Hind III fragment in 
PCIB4420 to make pCIB4423. The Hind III site is deleted by 
exonucleases in the cloning of pCIB4423. pCIB4423 contains the 
synthetic crylA(b) gene under the control of the PEPC promoter , 
and the PAT gene under the control of the 3SS promoter. 

10. Synthetic cryIA{b) gene in Aarobacterium strains: 

Acrobacterium strains made with the synthetic crylA(b) 
gene allow transfer of this gene in a range of dicotyledenous 
plants. AqrQfaa<?f fty^ittn vector pCIB4417 contains the 3.3 Kb Hind 
IIIXEco RI 35S: synthetic-Cry IA(b) :PepC:ivs#9:3SS fragment from 
PCIB4406 (Phe nutation) ligated to the 14 Kb Hind IIIXEco RI 
fragment from pBIlOl (Clontech) . Using elect roporat ion r 
PCIB4417 is transferred into the A. tiimefaciens strain LBA4404 
(Diethard et al., Nucleic Acids Research . Voll7:#lS:6747, 
1989.) . 

200 ng of pCIB4417 and 40 ul of thawed on ice LBA4404 
competent cell are electroporated in a pre-cooled 0.2 cm 
electroporation cuvette (Bio-Rad Laboratories Ltd.). Using 
Gene Pulser-TM with the Pulse Controller unit (Bio^Rad) , an 
electric pulse is applied immediately with the voltage set at 
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2.5 JcV, and the capacity ser ar 25 uF. After tiie pulse, ceils 
are immediately rransf erred to 1 ml of YSB medium and shaken at 
27 C for 3 hours before plating 10 ul on ABminrKmSO plates. 
After incubating at 28 C for approximately SO hours colonies 
are selected for miniscreen preparation to do restriction 
enzyme analysis. The final Aorobacrerium strain is called 
pCIB4417:LB&4404. 




The presence of the crylA(b) toxin protein is detected 
by utilizing enzyme-linked immunosorbent assay (ELISA) . SLISAS 
are very sensitive, specific assays for antigenic material. 
SLISA assays are useful to determine the expression of 



produced in response to immunizing rabbits with 
gradient-purified 3t crystals [Ang et al.. Applied Snviron. 
Microbiol. . 3S:S25-€2S {1978)3 solubilized with sodium dodecyl 
sulfate. ELISA analysis of extracts from transiently 
transformed maize cells is carried out using standard 
procedures (see for example HarloWf E., and Lane, 0. in 
"Antibodies: A Laboratory Manual*/ Cold Spring Harbor 
Laboratory Press, 1988) . SLISA techniques are further 
described in Clark et al.. Methods in Snzvmoloay , 118:742-7Sfi 
(1986); and 3radf ordr Anal. 3iochem. , 72:248 (1976). Thus, 
these procedures are well^jcnown to those skilled in the art. 
The disclosure of these references is hereby incorporated 
herein by reference. 

SLISA assays are performed to detect the production of 
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CrylA(b) protein in aiaiz protopiasrs, Prorein produced is 
reporred below as ng of Bt per mg zotai prorein (ng 3t/mg) . 
Sach consrrucr was tesred rwics. 
pCIB3069 No dacacrable Bt (both rests) 
pCI34407 21^900 ng Bt/mg roral prorein^ 

21^000 ng Bt/mg total protein 
The transformed maize cells produce high levels, on rhe 
order of approximately 20,000 ng of Bt CrylA(b) protein per mg 
total soluble protein, of the Bt I? when transformed with rhe 
maize optimized Bt gene. The level of detection of these SLISA 
based assays is about 1 to 5 ng CrylA(b) protein per mg 
protein. Therefore, the maize optimized Bt gene produces as 
much as approximately a 20,000 fold increase in expression of 
this protein in maize cells. 

3XAMPLE 42: ASSAY OF SXTRACT FROM TRANSFORMED PROTOPLASTS FOR 

INSSCTICIDAL ACTIVITY AGAINST SUROPSAN CORN BORER 

Western blot analysis is also performed using extracts 
obtained from maize cells which had been transiently 
transformed with ONA to express the maize optimized gene. When 
examined by western blots, this protein appears identical with 
the protein produced in coll . In contrast, as demonstrated 
in Sxample 6 above, no detectable Bt crylA(b) insecricidal 
protein is produced by maize cells transformed with comparable 
vectors arrempting to express the native Bt derived coding 
region . 

Qualitative insect roxiciry tesring can be carried out 
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using harvested protopiasrs • Suspensions are prepared for each 
replicate tesred in all bioassays. A replicate is considered 
positive if it causes significantly higher mortality than the 
controls. For example, replicates are tested for their 
activity against insects in the order Leoidootera by using the 
European com borer, Qstririia nubilalis. One-hundred ul of a 
protoplast suspension in 0.1% Triton X-100 is pipetted onto the 
surface of artificial Black cutworm diet, (Bioserv, Inc., 
Frenchtown, NJ; F9240) in 50 nm X 10 nna snap-cap petri dishes. 
After air drying 10 neonatal larvae are added to each plate. 
Mortality is recorded after about 4 days- When this protein is 
fed to European com borers, it produces 100% mortality. 

SXaMPIS 43: EXPHESSION OF SYNTHETIC 3T IH MAIZE MESOPHYLL 

PROTOPLASTS 

The general procedure for the isolation of com 
mesophyll protoplasts is adapted from Sheen et al., The Plant 
Call, 2:1027-1038 (1990) . The protoplast transformation system 
used in Sheen et al. is modified by using PSG mediated 
transformation, rather than electroporation. That procedure, 
as well as changes made in the isolation procedure, is 
described below. 

Maize Mesophyll Protoplast Isolation/Transformation 

1. Sterilize and germinate com seeds for leaf 
material. Seedlings are grown in the light at 25C. 

2. Surface sterilize leaf pieces of 10*12 day old 
seedlings with 5% Clorox for 5 minutes followed by several 
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washes wirh srerile discilled warer. 

3. Aliquor enzyme soiurion (see recipe below); 25 
ml/dish (100x25 nm perri disli) . 

4. Remove any excess warer from leaves and place 2 
inch pieces in each dish of enzyme. 14 plates are usually set 
up with the leaf material from about 100 seedlings. 

5. Cut leaves in longitudinal strips as thin as 
possible (2-5 mm) . 

6. Shake slowly at 2SC for S.S to 7 hours. Cover plates 
so that incubation takes place in the dark. 

7. Before filtering protoplasts, wash 100 um sieves 
with 10 ml 0.6 M mannitol. Pipet protoplasts slowly through 
sieves. Wash plates with O.S M mannitol to ^ther any 
protoplasts left in the dishes. 

8. Pipet filtered liquid carefully into 50 ml sterile 
tubes. Add equal volumes of O.S M mannitol to dilute. 

9. Spin for 10 minutes at 1000 rpm/500 g in table-top 

« 

centrifuge (Beckman Model TJ-6) . 

10 . Remove enzyme solution and discard. Resuspend 
pellets carefully in 5 ml mannitol. Pool several pellets. Bring 
volume to 50 ml with O.S M mannitol and spin. 

11. Resuspend to a known volume (50 ml) and count. 

12. After counting and pelleting, resuspend protoplasts 
at 2 million/ml in resuspending buffer (recipe below) . Allow 
opts to incubate in the resuspending buffer for at least 30 min 
before transformation. 

Transformation : 
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1. Aliquot piasmids to rutses (Fisherbrand polystyrene 
17 X 100 mm Snap Cap culture tubes) ; at least three replicates 
per treatment; use equimolar amounts of piasmids so tiiat equal 
gene copy numbers are compared. 

2. Add 0.5 ml protoplasts and 0.5 ml 40% PEG made with 

0.6 Mmannitol. 

3. Shake gently to mix and incubate at 25C for 30 min. 

4. Add protoplast culture media at 5 min intervals: 

1,2^5 ml 

5. Spin for 10 min at 1000 rpm/500 g. 

0. Remove liquid from pellet and resuspend in 1 ml 
culture media IBWT media) 

7. Incubate overnignt at 25C in tbe dark. 



Snzyme Solution 
0.6 H mannitol 
10 mM MBSr pfl 5.7 
1 mU CaCLj 
1 mM HgCl2 
0.1% BSA 
filter^sterilize 



To tliis solutiour add tbe following enzymes 
1% Callulase RS, and 0.1% Macerozyme RIO 



Wasb Buffer: O.S M mannitol, filter^sterilize 
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Resuspending Buffer: 0.5 M maimirol^ 20 mM KCl^ 
filter-sreriiize 

Culture Media: BMV media recipe from: 

Qlcuno ez al., Phvtioparholoav S7:S10-Si5 (1977). 

0 . o M mannitol 

4 mM MES, pH 5.7 

0. 2 mM KH-PO- 

2 4 

1 mM KNO^ 

1 mM MgSO^ 

10 mM CaCl^ 

IX K3 micronutrients 

filter-sterilize 

SLISA analysis of transformed protoplasms is done one 
day after transforaarion. SLISA' s are done as previously 
described. The following three experiments are done with maize 
inbred line 211D. Of course, other lines of maize may be used. 
50 ug of plasmid pCZB441d and equimolar amounts of other 
plasmids are used. Total soluble protein is determined using 
the BioRad protein assay. (Bradford, Anal.Biocfaem . 72:248 
(1976) . 

Transformation Sroeriment : 
Constructs tested: 

1. ?CI34413 (Construct contains synthetic* Bt under control of 
CaMV 2SS promoter and 35S/PAT and 25S/Gas marker genes) 

2. pCI34420 (Construct contains synthetic Bt under control of 
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Pitii promoter and PAT aiarXer gene) 

3. OCIB4421 (Construct contains syntixetic at under control of 



4 . pcIB4423 (Construct contains syntlietic Bt under control of 
PEPC promoter and PAT marker gene) 

(PEPC:syntiietic-cryIA(b) :PepC intron:35S + 35S:PAT:35S) 



In the following experiments^ 10 or 11 day old 211D 
seedlings are analyzed for production of tlie Bt CrylACb) 
protein in the Biorad protein assay: 



Bxperiment 1 (11 day seedlings) : 



pCZB4419 15,000 ± 3,000 ng Bt/mg protein 

PCIB4420 280 ± 65 ng Bt/mg protein 

DCIB4421 9r000 ± 800 ng Bt/mg protein 



Sxperiment 2 (10 day seedlings) : 



pCIB4419 SfOOO ± 270 ng Bt/mg protein 

PCIB4420 80 ± 14 ng Bt/mg protein 

DCI34421 1,600 ± 220 ng Bt/mg protein 



Experiment 3 (11 day seedlings) : 



PCIB4419 



21,500 ± 1,800 ng Bt/mg protein 
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OCIB4420 
OCI54421 
DCIB4423 



260 ± 50 ng 3t/mg prorsin 
11,900 ± 4,000 ng 3t/mg prorein 
7,200 ± 3,400 ng 3t/mg protein 



The above experiments confirm tihat botii tiie CaHV 3SS 
and ?EPC promoters express the synthetic 3t CrylA(b) protein ai 
very high levels. The pith promoter, while less efficient, is 
also effective for the expression of synthetic CrylACb) 
protein . 



SX&MPLE 44: STABLE SXPRESSION OF SYNTHETIC 3T IN LZTTUCS 

The synthetic 3t gene in the Aqrobacterium vector 
PCIB4417 is transformed into Lactuca sativa cv. Redprize 
(lettuce) . The transformation procediire used is described in 
Enomoto et al.. Plant Call Reports , 9:6-9 (1990). 
Transformation procedure: 

Lettuce seeds are suface sterilized in 5% Clorox for 5 
minutes followed by several washes in sterile distilled water. 
Surface-sterilized seeds are plated on half strength MS media 
(Murashige and Skoog, Phvsiol. Plant . 15:473-497 (1962)). 
Cotyledons of S-day-old Redprize seedli n gs, grown under 
illumination of 3,000 Ix 16 hr at 2SC, are used as the explants 



for Aorobacterium i 



tion. The base and tip of each 



cotyledon are removed with a scalpel. 



explants are soaked 
for 10 minutes in the bacterial solution which have been 
cultured, for 48 hours in AB minimal media with the apropriate 
ics at 28C. After blotting excess bacterial solution 
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on 



sterile filter paper, tiie expiants are placed on MS m e dia 



(0.1 mg/l 3A and 0.1 mg/l NAA) for 2 days. Sxplants are 
transferred to selective media containing 500 mg/l 



■ I 



and 50 mg/l Icanamycin. The explants are 
sufacultured to fresii media weekly. The growtli chamb er 
conditions are 16 hour 2,000 Ix light at 25C. After 
approximately 4 weeJcs, an SLZSA is done on healthy looicing 
callus from each of four plates being subcultured. The SLISA 
procedure is the same as described above for protoplasts; 
soluble protein is again determined by the 3iorad assay 



Results : 



pCIB3021 (kan control) 

PC1B4417 (plate 1) 

PCIB4417 (plate 2) 

PCI34417 (plate 3) 

PCIB4417 (plate 4) 



505 ng 3t/mg protein 
45 ng Bt/mg protein 
1,200 ng Bt/mg protein 



This example demonstrates that dicot plants can also show 
increased expression of the optimized insecticidal gene. 
Sxample 45. Construction of pCZB4429. 

PCIS4429 contains a preferred maize pollen->specif ic 
promoter fused with the maize optimized crylA(b) gene. The 
pollen-specific maize promoter used in this construct was 
obtained from the plasmid pKL2, described in Sxas^le 37. The 
maize optimized crylA(b) gene was obtained from plasmid 
PCI3441S, also described in Sxample 37. 

pKL2 is a plasmid that contains a preferred maize 
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pollen-specific promorer fused wirh the coii 

beta-glucuronidase gene. It was canstrucred from plasmids 

pSKllO and ?CIB3054. pSKllO contains tiie pollen specific maize 

oromorer. oCIB30S4r a oDClS derivative, contains tiie coll 
* * * 

beta-glucuronidase (GUS) gene fused witii tHe cauliflower mosaic 
virus (CaMV) 35S promoter. It's construction is described 
elsewhere in this application. This promoter can be removed 
from this plasmid by cutting with Sall/Hindlll to yield a 
fragment containing the GOS gene, a bacterial ampicillin 
resistance gene and a ColEI origin of replicanion. A second 
fragment contains the CaMV 35S promoter. 

pCIB3054 was cut with the restriction enzymes Sail and 
HindZII, using standard conditions, for 2 hours at room 
temperature. The reaction was then extracted with 
phenol/ chloroform using standard conditions and the DIUL 
recovered by ethanol precipitation using standard conditions. 
The recovered Dim was resuspended in buffer appropriate for 
reaction with calf intestinal alkaline phosphatase (GIF) and 
reacted with 2.S units of CI? at 37 •€ overnight- After the CIP 
reaction^ the DHA was purified on an agarose gel using standard 
conditions described elsewhere in this application. pSKIlQ was 
cut with Sall/BindZII under standard conditions for 2 hours at 
room temperature and the DNA siifasequently purified on an 
agarose gel using standard conditions . The recovered DNA- 
fragments were ligated using standard conditions for two hours 
at room temperature and subsequently transformed into cosraetent 
S. coii strain HBlOl calls using standard conditions. 
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Trans formanrs were selecred on L-agar containing 100 iig 
ampiciiiin/mi . Transf ormants were characrerized for zhe desired 
plasmid cons'Crucr using standard plas&id mini^screen 
procedures* The correcr consrruct was named pKZ«2. 

To maice pCIB44Z9, a three way ligation was performed 
using standard conditions knotm to those in the art. The three 
fragments ligated were: 

1) a Hindlll/BamHI fragment from pCrB4418, of about 4.7 
kb in sizer containing the crylACb) gene, the bacterial 
ampicillin resistance gene, and the Col£I origin of replication 

2) a Hindlll/Xbal fragment from pKL2 of about 1.3 Icb in 
size and containing the pollen specific promoter from maize 

3) a PCR generated fragment derived from the pollen 
promoter with a 3amHI site introduced downstream from the start 
of transcription. This fragment is approximately 120 bp and 
has ends cut with the restriction enzymes Xbal/BamHI. 

The SCSI fragment was generated using a 100 ul reaction 
volume and standard conditions described above. The primers 
used were: 

SKSO: S'-CCC TTC AAA. ATC TAG AAA CCT-3' 

XE127: 5'-GC6 GAT CCG GC7 GCS GCS GGG AAC GA-3' 

The above primers were mixed in a PGR reaction with 
plasmid pSKlOS, a plasmid that contains the pollen specific 
promoter from maize. 

Af^er the PGR reaction was complete , 10 ul of the 
reaction was run on an agarose gel, using standard condition, 
to maJce sure the reaction produced the expected size product. 
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The remaining 90 ul was treated with proteinase K at a final 
concentration of 50 ug/nii for 30 aiin. at 37 •C, The reaction was 
then heated at 65*C for 10 niin.r then phenol/chloroform 
extracted using srandard procedures. The ONA was recovered froo 
the supernatant by precipitating with two volumes of ethanol 
using standard conditions. After precipitation, the ONA was 
recovered by centrifuging in a microfuge. The pellet was rinsed 
one time with 70% ethanol (as is standard in the art) , briefly 
dried to remove all ethanol, and the pellet resuspended in 17 
ul T£ buffer. 2 ul of lOX restriction enzyme buffer was added 
as were 0.5 ul BamHI and 0.5 ui Xbal. The DMA was digested for 
1 hour at 37 *C to produce a DNA fragment cut with Xbal/Bamfil. 
After digestion with the restriction enzymes, this fragment was 
purified on an agarose gel composed of 2% NuSieve (FMC} /1% 
agarose gel. Millipore filter units were used to elute the DNA 
from the agarose using the manufacturer' s specifications . After 
elution, the OKA was used in the three-way ligation described 
above . 

After ligation r the DMA was transformed into competent 
B. coli strain HBlOl cells using standard techniques. 
Transformants were selected on L**agar plates containing 
ampicillin at 100 ug/ml. Colonies that grew under selective 
conditions were characterized for plasmid inserts using 
techniques standard in the art. 



EXAMPLE 46. Construction of pCI34431, a vector for tissue 
specific expression of the synthetic crylA(b) gene in plants 

pCIB4431 is a vector designed to transform maize. It 
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conrains cwo ciumeric Bt endccoacin genes expressible in maize. 
These genes are the PEP carhoxyiase promorer/syntiietic-crylACb) 
and a pollen pramoter/synrliet:ic-cryIA(b) • The PEP 
carboxylase/ cry IA(b) gene in this vecror is derived from 
PCIB4421 described above. The pollen promoter is also described 
above. Fig. 20 is a map of plasmid pCIB4431- pCIB4431 was 
construcred via a three parr ligation using the about 3.5 Kb 
Xpn I /Hind III fragment (containing 

pollen/synthetic-cry IA(b) from pCIB4429r the about 4.S Kb Hind 
III/Eco RI (PEPC/syntheric-cryIA(b) and the about 2.6 Kb Kpn 
I/Eco RI fragment from the vector Bluescript. 

Other vectors including the pollen promoter/ synthetic 
CrylA(b) chimeric gene include pCIB4428 and pCIB4430. See ?igs. 
21 and 22- pCIB4430 also contains the ?EEC/synthetic-Bt gene 
described above. 

SSCaHPLE 47. Production of transgenic maise plants contadLning 
the synthetic maise optimized CrylA(b) gene 

The example below utilises Biolistics to introduce DIA 

coated particles into maize, cells, from which transformed 

plants are generated. 

Experiment KC«€5 

Production of transgenic maize plants expressing the synthetic 

crylA(b) gene using a tissue-specific promoter. 

Tissue 

Immature maize embryoSf approximately 1.5-2.5 mm in lengthy 
were excised from an ear of genotype 6N615 14-15 days after 
pollination. The mother plant was grown in the greenhouse. 
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Before excision, the ear was surface srerilized witii 20% Clorox 
for 20 minures and rinse 3 times with sterile water. 
Individual embryos were plated scutellum side in a 2 cm square 
area, 3 6 embryos to a plate, on the callus initiation medium, 
2DG4 r- 5 chloramben medium {N6 major salts, B5 minor salts, MS 
iron, 2% sucrose, with 5 mg/1 chloramben, 20 mg/1 glucose, and 
10 ml G4 additions (Table 1) added after autoclaving. 

TABLE 1 " G4 Additions 



Ingredient oer liter medium 

Casein hydrolysate 0.5 gm 

Proline 1.38 gm 

Nicotinic acid .2 mg 

?yridoxine-HCl .2 mg 

Thiamine-'HCl , 5 mg 

Choline-HCl . 1 mg 

Riboflavin .05 mg 

Biotin . 1 mg 

Folic acid ,05 mg 

Ca pantothenate .1 mg 

p-aminobenzoic acid . 05 mg 

312 .136 ug 



Bombardment 

Tissue was bombarded using the PDS-lOOOHe Biolistics 
device. The tissue was placed on the shelf 8 cm below the 
stopping screen shelf. The tissue was shot one time with the 
DNA/gold microcarrier solution, 10 ul dried onto the 
macrocarrier . The stopping screen used was hand punched at 
ABRU using 10x10 stainless steel mesh screen. Rupture discs of 
1550 psi value were used. After bombardment, the embryos were 
cultured in the dark at 25* C. 



Preparation of DNA for deliverv 
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The microcarri r was prepared essentially according ro 
the insrrucrions supplied wim riie Bioliscic device. While 
vorrexing 50 ul 1.0 u gold microcarrier, added 5 ul pCI34431 
(1.23 ug/ul) [#898] + 2 ul nCIB3064 0,895 ug/uD [#456] 

followed by 50 ul 2.5 M CaClj. then 20 ul 0.1 M speraiidine 
(free base, TC grade) . The resulting mixture was vorrexed 3 
minutes and microfuged for iO sec. The supernatant was removed 
and the icrocarriers washed 2 times with 250 ul of 100% StOH 
(HPLC grade) by vorrexing briefly, centrifuging and removing 
the supernatant:- The microcarriers are resuspended in 65 ul 

100% StOE. 

Callus formation 

Smbryos were transferred to callus initiation medium 
with 3 mg/1 PPT 1 day after bombardment. Smbryos were scored 
for callus initiation at 2 and 3 weeks after bombardment. Any 
responses were transferred to callus maintenance medium ^ 2DG4 
0.5 medium with 3 mg/L PPT. Callus maintenance medium is 

N6 major salts, B5 minor salts r iron, 2% sucrose r with 0.5 
mg/1 20 mg/1 glucose, and 10 ml G4 additions added after 

autoclaving. Embryogenic callus was subcultured every 2 weeks 
to fresh maintenance medium containing 3 mg/L PPT. All callus 
was incubated in the dark at 25 *C. 

The Type I callus formation response was 15%. Every 
embryo which produced callus was cultured as cui individual 
event giving rise to an individual line. 
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Reaenerarion 

After 12 weeics on selecrion, the tissue was removed 
from callus maintenance edium witli ??T and was placed on 
regeneration medium. Regeneration medium is 0.25MS3S5BA (0.25 
mg/1 2r4 D, 5 mg/1 BAP, MS salts, 2% sucrose) for 2 weeics 
followed by subculture to MS3S medium for regeneration of 
plants. After 4 to 10 weeks r plants were removed and put into 
GA 7's. Our line KCS5 0-6, which became the #176 BT event, 
produced a total of 38 plants. 

Assays 

All plants, as they became established in the GA7's, 
were tested by the chlorophenol red (CR) test for resistance to 
PPT as described in U.S. Patent Application 07/759,243, filed 
September 13, 1991, the relevant portions of which are hereby 
incorporated herein by reference. This assay utilizes a pH 
sensitive indicator dye to show which cells are growing in the 
presence of PPT. Ceils which grow produce a pH change in the 
media and turn the indicator yellow (from red) . Plants 
expressing the resistance gene to PPT are easily seen in this 
test. (#176 =» 8 positive/30 negative) Plants positive by the 
ca test were assayed by PGR for the presence of the synthetic 
BT gene. (#176 « 5 positive/ 2 negative/ 1 dead) 

Plants positive by PCR for the syn-3T gene were sent to 
the phytotron. Once established in the phytotron, they were 
characterized using insect bioassays and ELISA analysis. 
Plants were insect bioassayed using a standard European Corn 
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Borer assay (described in Example 5A) in which small pieces of 
leaf of clipped from a plant and placed in a small pecri dish 
with a number of SC3 neonate larvae. Plants are typically 
assayed at a height of about 6 inches. Plants showing 100% 
mortality to EC3 in this assay are characterized further. SLISA 
data are shown below. Positive plants are moved to the 
greenhouse . 

Greenhouse / Fertility 

Plant number #175-11 was pollinated with wild-type 
SNSIS pollen. One tassel ear and one ear shoot were produced. 
All of the embryos from the tassel ear (11) and 56 kernels from 
Ear 1 were rescued. 294 ]cemels remained on the ear and dried 
down naturally. 

Pollen from #176-11 was outcrossed to various maize 
genotypes 5N9B4, 5NA89, and 3N961. Embryos have been rescued 
from all 3 outcrosses (5N984 = 45; 5NA89 « 30; 3N961 = 8) . 
Mosr of the kernels remained on the ears on the plants in the 
greenhouse and were dried down naturally. 

DNA was isolated from plane #176-11 using standard techniques 
and analysed by Southern blot analysis • It was found to 
contain sequences which hybridize with probes generated from 
the synthetic cryIA<b) gene and with a probe generated from the 
PAT gene. These results showed integration of these genes into 
the genome of maize. 



Experiment KC-64 
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Producrion of transgenic aaize plants expressing tiie synriienic 
crylA(b) gene using a consriturive promoter. 



Tissue 

Immature maize embryos, approximately 1.5-2.5 mm in 
lengtiif were excised from an ear of genotype 0N6IS 14-15 days 
after pollination. The mother plant was grown in the 
greenhouse. Before excision , the ear was surface sterilized 
with 20% Clorox for 20 minutes and rinse 3 times with sterile 
water. Individual embryos were plated scutellum side in a 2 cm 
square area, 36 embryos to a plate, on the callus initiation 
medium, 2DG4 ^ 5 chloramben medium (N6 major salts, B5 minor 
salts, MS iron, 2% sucrose, with 5 mg/1 chloramben, 20 mg/1 
glucose, and 10 ml G4 additions Table 1) added after 
autoclaving. 

1 -"g4 Additions 



Ingredient per liter medium 

Casein hydrolysate 0.5 gm 

Proline 1.38 gm 

Nicotinic acid .2 mg 

Pyridoxine-HCl . 2 mg 

Thiamine-HCl . 5 mg 

Choline-HCl . 1 mg 

Riboflavin .05 mg 

Biotin . 1 mg 

Folic acid .05 mg 

Ca pantothenate .1 mg 

p-aminobenzoic acid . OS mg 

.312 .136 ug 

Bombardment 

Tissue was bombarded using the PDS-lOOQHe Biolistics 
device. The tissue was placed on the shelf 8 cm below the 
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stopping screen shelf. The tissue was shot one time with the 
DNA/goid microcarrier solution^ 10 ^1 dried onto the 
macrocarrier. The stopping screen used was hand punched at 
ABRU using 10x10 stainless steel mesh screen. Rupture discs o 
1S50 psi value were used. After bombardment, the embryos were 
cultured in the dark at 25 • C. 



Preparation of DMA for delivery 

The microcarrier was prepared essentially according to 
the instructions supplied with the Biolistic device. While 
vortexing 50 ul 1.0 u gold microcarrier, added 3.2 ul pCIB4418 
(0.85 pg/ul) [#905] + 2 ul PCIB3064 0.895 ug/uD C#45S] + 

1.0 ul PCIB3007A (1.7 ug/ul) [#152] followed by 50 ul 2.5 M 
CaCl2/ then 20 ul 0.1 M spermidine (free base, TC grade) . The 
resulting mixture was vortexed 3 minutes and microfuged for 10 
sec. The supernatant was removed and the micro carriers washed 
2 times with 250 ul of 100% EtOH (HPLC grade) by vortexing 
briefly, centrifuging and removing the supernatant. The 
microcarriers are resuspended in S5 ul 100% StOH. 

Callus formation 

Embryos were transferred to callus initiation medium 
with 3 mg/1 PPT 1 day after bombardment. Embryos were scored 
for callus initiation at 2 and 3 weeks after bombardment. Any 
responses were transferred to callus maintenance medium, 2DG4 + 
O.S 2,4-D medium with 3 mg/L PPT. Callus maintenance medium is 
N6 major salts, 35 minor salts, MS iron, 2% sucrose, with 0.5 
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mg/1 20 mg/1 glucose, and 10 ml GA additions added aftier 

ucoclaving. Smbryogenic callus was subcultured every 2 weeJcs 
■zo fresh maintenance medium containing 3 mg/L PPT. All callus 
was incubated in tlie darJc at 2S*C. 

rne Type I callus formation response was 18%. Every 
embryo which produced callus was cultured as an individual 
event giving rise to an individual line. 

Regeneration 

After 12 weeks on selection, the tissue was removed 
from callus maintenance medium with PPT and was placed on 
regeneration medium and incubated at 25 • C using a 16 hour 
light (50 u£ .ia-2 - s-l) / 3 hour dark photoperiod. 
Regeneration medium is 0.25MS3SSBA (0.25 mg/1 2,4 D, 5 mg/1 
BAP, MS salts r 3% sucrose) for 2 weeks followed by sxibculture 
to MS2S medium for regeneration of plants. After 4 to 10 
weeks, plants were removed and put into GA 7's. . Our line KCS4 
0-1, which became the #170 BT event, produced 55 plants. Our 
line KCS4 0-7, which became the #171 BT event, produced a total 
of 33 plants. 

Assays 

Eleven plants, as they became established in the GA7's, 
were tasted by the chlorophenol red iCR) test for resistance to 
PPT as per Shiliito, et al, above. This assay utilizes a pfl 
sensitive indicator dye to show which cells . are growing in the 
presence of PPT, Cells which grow produce a pfl change in the 
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aiedia and t^im rh *» indicator yellow (from red) . Planrs 
expressing the resistance gene ro PPT are easily seen in zhLs 
tssr. Plants positive by the CR test were assayed by PGR for 
the presence of the synthetic BT gene. (Event 170 - 37 
positive/18 negative; #171 » 25 positive/8 negative) . 

Plants positive by PGR for the syn-at gene were sent to 
the phytotron. Once established in the phytotron, they were 
characterised using insect bioassays and SLISA analysis. 
Plants were insect bioassayed using a standard European com 
borer assay (see below) in which small pieces of leaf of 
clipped from a lant and placed in a small petri dish with a 
number of SC3 neonate larvae. Plants are typically assayed at a 
height of about 6 inches. Plants showing 100% mortality to ECB 
in this assay are characterized further. ELZSA data are shown 
below. Positive plants are moved to the greenhouse. 

3asta screening 

Sight of the mature plants from the #170 event were 
selected for evaluation of Basta [Hoechst] resistance. On one 
middle leaf per plant, an area approximately 10-14 cm long X 
the leaf width was painted with 0, 0.4, 1.0 or 2.0% (10 ml of 
200 g/L diluted to 100 ml with deionized water) aqueous Basta 
containing 2 drops of Tween 20/100 ml. Two plants were tested 
per level. Sight wild-*type SN615 plants of the same 
approximate age were treated as controls. All plants were 
observed at 4 and 7 days. All of the control plants eventually 
died. Throughout the study, none of the #170 plants displayed 

152 



,WO 93/07278 

4 



S92/08476 



any damage due zo zhe heraicide. 

Pollinarion 

All tassel ears, first ear and, if available, the 
second ear on the #170 and #171 plants v^ere pollinated with 
wild-type SN61S pollen. At least 90% of the plants were female 
fertile. 

Pollen from #171 plants was outcrossed to genotypes 
6N61S, SN984, 5NA89, ePOlO, 5NA56, 2N217AF, 2ND01 and 3N961* 
At least 90% of the plants were shown to be male fertile. 

Smbrvo Rescue 

Smbryos from the #171 event have been "rescued." 
Fourteen to IS days after pollination, the ear tip with 25-50 
}cernels was cut from the ear with a coping saw. Prior to 
cutting, the husics were gently peeled away to expose the upper 
portion of the ear. The cut end of the ear on the plant was 
painted with Captan fungicide and the husics replaced. The seed 
remaining on the plant was allowed to dry naturally. 

The excised ear piece was surface sterilized with 20% 
Clorox for 20 minutes and rinsed 3 times with sterile water. 
Individual embryos were excised and plated scutellum side up on 
as medium [Gamborg] containing 2% sucrose. B5 itamins are 
added to the medium after autoclaving. Four embryos were 
plated per GA7 container and the containers incxifaated in the 
dar3c. When germination occurred, the containers were moved to 
a light culture room and incubated at 25* C using a 16 hour 
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lighr (50 uE .m-2 . s-1) /. 3 hour daric photoperiod. Tiie 
gerainarion frequency is 94%. 

Progeny from IS plants of tiie #171 event and 2 of Ciie 
#176 evenr were rescued using standard embryo rescue teciiniques 
and evaluated. All plants were evaluated by insect assay. 
Plants from tiie #171 event were also tested in tiie 
histochemical GUS assay. In both the insect assay and the GUS 
assay, the ratio of segregation of the transgenes was 1:1, as 
expected for a single locus insertion event* 

EXAMPLE 48. Analysis of transgenic maize plants 
ELI5A AS SAT 

Detection of crylA(b) gene expression in transgenic 
maize is monitored using European com borer (ECB) insect 
bioassays and SLISA analysis for a quantitative determination 
of the level of crylA(b) protein obtained. 
Quantitative determination of crylA(b) I? in the leaves of 
transgenic plants was performed using enzyme-- linked 
immunosorbant assays (SLISA) as disclosed in Clark M ?, Lister 
R Mr 3ar*- Joseph SLISA Techniques. In: Weissbach A, Weissbach 
H (eds) Methods in Snzvmoloav 118:742-766, Academic Press, 
Florida (1986) . Tmmunoaf finity purified polyclonal rabbit and 
goat antibodies specific for the 3. thurinaiensis subsp. 
kurstaki I? were used to determine ng IP per mg soluble protein 
from crude extracts of leaf samples. The sensitivity of the 
double sandwich SLISA is 1-5 ng IP per mg soluble protein using 
SO ug of total protein per SLISA microtiter dish well. 
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Com excracrs were made by grinding leaf tissue in 



gauze lined plasric bags using a hand held ball-bearing 
homogenizer (AGDIAr Slfcarr IN.) in the presenc of exrraction 
buffer (50 mM Na-CO^ pH 9.5, 100 mM HaCl, 0.05% Triton, 0.05% 
Tween, 1 mM 9MSF and 1 uM leupeptin) . Protein deteraination was 
performed using the Bio-Rad (Richmond, CA) protein assay. 



transformants described above were analyzed for the presence of 
crylA(b) protein using ELISA. These plants varied in height 
from € inches to about three feet at the time of analysis . 



Using the above procedure, the primary maize 



Plant 



Bt ng/mg soluble protein 



5/27/9L 



175- 8 

176- 10 
175-11 
171-4A 
171-6 
171-8 
171-9 
171-13 



0 

700 
760 
59 
50 
60 
280 
77 
43 
60 
55 
13 
19 
19 



0 

1585 
2195 



171-14A 



171-14B 

171-15 

171-lSA 

I71-16B 

171-18 

176-30 

171-32 

171-31 

171-30 

71-14 



980 
166 
370 



1160 



#10 leaf 
1 leaf 




plant 171-16 
#9 leaf 
#1 leaf 



40 

120 
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EUROPEAN CORN BORER ASSAY 

1. One ro four 4 ca secrions are cur from an exrended leaf of a 
com plane. 

2. EacH leaf piece is placed on a moisrened filter disc in a SO 
X 9 mm petri dish. 

3. rive neonate European com borer larvae are placed on 
leaf piece. (Making a total of S--20 larvae per plant.) 

4. The petri dishes are incubated at 29.5 •C. 

5. Leaf feeding damage and mortality data are scored at 24, 48, 
and 72 hours. 



EXAMPLE 49. Expression of St endotoxin in progeny of 
transformed maize plants 

The transformed maize plants were fully fertile and 
were crossed with several genotypes of maize. Progeny from 
these crosses were analyzed for their ability to Icill European 
com borer (EC3) in a standard ECS bioassay (described 
immediately above) as well as for the presence of the crylACb) 
protein using ELISA as described above. The ability to kill EC3 
and the production of crylACb) protein correlated. These traits 
segregated to the progeny with a 1:1 ratio , indicating a single 
site of insertion for the active copy of the synthetic gene. 
This 1:1 ratio was true for both the constitutive 
promoter/ synthetic-cry IA(b) plants and the tissue specific 
pr omoter/ synthetic -cry IA(b) plants (data not shown) . 

Fig. 23A i^ a table containing a small subset of the 

15 S 



wo 93/07278 



'/US92/08476 



torai number of progeny analyzed. This cable is represenrariv 
of a niimber of different crosses. 

Insecr assays were done with Diatrea saccharalis and 
Ostrinia nubilalis using leaf material (as described above) of 
transgenic progeny containing a maize optimized CrylA(b) gene. 
The results of these assays are shown in Fig. 23B. They 
demonstrate that the maize opt: mi zed CrylA(b) gene functions in 
transformed maize to provide resistance to Sugarcane borer and 
Ostrinia nubilalis . 

SXAMPLS 50. EXPRESSION OF THE CRYIA(b) GENE IN MAIZE POLLEN 

Progeny of the transformed maize plants containing the 
chimeric pollen promoter/ synthetic crylA(b) gene derived from 
pCIB4431 were grown in the field to maturity. Pollen was 
collected and analyzed for the presence of the crylA(b) protein 
using standard ELISA techniquesd as described elsewhere. High 
levels of cryIA{b) protein were detected in the pollen. 
Progeny from the 35S promoter/ synthetic crylA(b) transformed 
plant were grown in the greenhouse. Pollen from these plants 
was analyzed using ELISA, and crylA(b) protein was detected. 

Results are shorn below in Figure 23C. 

It is recognized that factors including selection of 
plant lines r plant genotypes, synthetic sequences and the like, 
may also affect expression. 

EXAMPLE 51. EXPRESSION OF THE CRYIA{b) GENE FUSED TO A 
PITH-PREFERRED PROMOTER. 

pCIB4433 (Fig. 36) is a plasmid containing the maize 
optimized CrylA(b) gene fused with the pith-pref erred promoter 
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isoiared from maize. Tliis piasmid was cansrrucred using a 
riiree^way ligation consisting of: 

1) pCI34418r cut with 3stEII and 3amHI; 1.8 Kb fragment 

2) pBtinl, cut witii Nsil and 3stEII; 5.9 Kb fragment; 
pBtinl is described elsewhere in tixis application 

3) PGR fragment VT-151 was generated in a PGR reaction 
using standard conditions as described elsewhere in this 
application. 

PCS primers utilized were: 

KE1S0A28; 5' -ATT CGC ATG CAT GTT TCA TTA TC-3' 

KE151A28: 5*^ - GCT GGT ACC ACG GAT CCG TCG CTT CTG TGC AAC AAC 
C-3' 

After the PCS reaction, the DNA was checked on an 
agarose gel to make sure the reaction had proceeded properly. 
DKA was recovered from the PGR reaction using sta n da r d 
conditions described elsewhere and stibsequently cut with the 
restriction enzymes Nsil and BamHI using st and ard condition. 
After cutting/^ the fragment was run on a 2% NuSieve gel and the 
desired band recovered as described elsewhere. The DNA was 
used in the ligation described above. 

After ligation (under standard condition) , the DNA was 
transformed into competent coll call. 

Transformation was carried out using micropro jectile 
bombardment essentially as described elsewhere in this 
application. Embryos were transferred to medium containing 
lOapg/ml PPT 24 hours after micropro jectile bombardment. 
Resulting callus was transferred to medium containing 40 pg/ml 
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PPT after four weeks. Plants were regenerated without 
selection. 

A small sample of plants (3-5) was assayed by ?CR for 
each event- Further codes were added to indicate different 
positions and distances of embryos with respect to the 
micropro jectile bombardment device. Plants were sent to the 
greenhouse having the following codes: 

JS21A TOP Plants 3,t: . PGR Positive 

JS21A MID Plants a>t . PGR Positive 

JS21C 30T Plants B. t . PGR Positive 

JS22D MID Plants 3,t. PGR Positive 

JS23B MID Plants B.t. PGR Negative (for control) 

Leaf samples from the regenerated plants were 
bioassayed for insecticidal activity against Suropean com 
borer as described in Sxample 48 with the results shown in Fig. 
23D. 

ZLISA analysis of leaf samples to quantify the level of 

CrylA(b) protein expressed in the leaves was carried out as 

* 

described in Kxample 48 with the results shown in Fig. 24E. 

Deposits 

The following plasmids have been deposited with the 
Agricultural Research Culture Collection (NRRIi) (1318 N. 
University St., Peoria, XL S1S04) under the provisions of the 
Sudapest Treaty: ?CI34418, pCia4420, pCI34429, pCI34431, 
?CI34423, PCI3S601, pCIB2166 and ?CIB3171. 
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The present: Invencioii has been described with reference 
to specific embodimenrs thereo£; however it will be appreciated 
that nmerous variations^ modi f ications , and embodiments are 
possible. Accordingly, all such variations, modifications and 
eobodiments are to be regarded as being within the spirit and 
scope of the present invention. 
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SEQUENC2 LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Svola, Stephen V. 

Crossland, Lyle 0. 
Wrigbt, Harrha S. 
Merlin, Ellis J. 
Launis, Karen L. 
Rociiscein, Steven J. 

(ii) TITLE OF INVENTION: SYNTHETIC DNA SEIQUENCS HftVING ENHANi 
INSECTICIDAL ACTIVITY IN MAIZE 

(iii) NUMBER OF SEQOENCSS: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: CIBA-GSIGY Corporation 

(B) STREET: 7 Skyline Drive 

(C) CITY: Hawtiome 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10532 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-OOS/MS-OOS 

(D) SOFTMARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: 

(B) REGISTRATION NUMBER: 

(C) REFERENCS/DOCSZT NUMBER: CGC 1577/CIP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (919)541-8615 

(B) TELEFAX: (919)541-3689 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : si.ngle 

(D) TOPOLOGY: linear 
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(ii) MOLECULS TXPE: DNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE; NO 

(vi) ORIGINAL SODRCS: 

(A) ORGANISM: Bacillus wiiurlngiensls 

(B) STRAIN: Kurstaici HD-1 

iC) INDIVIDUAL ISOLATE: CrylA(b) insecricidal crystal 
prorein gene 



(xi) SBQDENCS DESCRIPTION: SEQ ID N0:1: 



GTTAACACCC 


TGGGTCAAAA 


ATTGATATTT 


AGTAAAAT7A 


GTTGCACTTT 


G7GCATTT7T 


60 


TCATAAGATG AGTCATATGT 


TTTAAATTGT 


AGTAATGAAA 


AACAGTATTA 


TATCATAATG 


120 


AATTGGTATC 


TTAATAAAAG 


AGATG6AGGT 


AACT2ATGGA 


TAACAATCCG 


AACATCAATG 


180 


AATGCATTCC 


TTATAATTGT 


TTAAGTAACC 


CTSAAGTAGA 


AGTATTAGGT 


GGAGAAAGAA 


240 


TAGAAACTGG 


TTACACCCCA 


ATCSATATTT 


CCTTGTCGCT 


AACSCAAT7T 


cTrrrGafiTG 


300 


AATTTGTTCC 


CGGTGCTGGA 


TTTGTGTTAG 


GACrrAGTTGA 


TATAAXATGG 


GGAATTTTTG 


360 


GTCCCTCTCA ATGGSACGCA 


TTTCTTGTAC 


AAATTGAACA 


GTTAHTTAAC 


CAAAGAATAG 


420 


AAGAATTCGC 


TAGGAACCAA GCCATTTCTA 


GAT7AGAAGG 


AC7AAGCAA7 


CTTTATCAAA 


480 


TTTACGCAGA ATCTTTTAGA 


GAGTGGGAAG 


CAGATCCTAC 


7AA7CCAGCA 


T7AAGAGAAG 


540 


AGATGCGXAT 


TCAATTCAAT 


GACATGAACA 


G7GCCC77AC 


AACCGCTATT 


CCTCTTTTTG 


600 


CAGTTCAAAA 


TTATCAA6TT 


CCTCTTTTAT 


CAGTATAT6T 


TCAAGC7GCA 


AATTTACAT7 


660 


TATCAGTTTT 


GAGAGATGTT 


TCAGTGTTTG 


GACAAASS7G 


GGGAT77GA7 


GCCGCGACTA 


720 


TCAATAGTCG 


TTATAATGAT 


TTAACTAGGC 


TTATTGGCAA 


C7A7ACAGAT 


CA7GC7GTAC 


780 


GCTGGTACAA 


TACGGGATTA 


GAGCGTGTAT 


GGGGACCSGA 


TTC7AGAGAT 


7GGATAAGAT 


840 


ATAATCAATT 


TAGAAGAGAA TTAACACTAA 


CTG7ATTAGA 


TATCGTTTCT 


CTATTTCCGA 


900 


ACTATGAIAG 


TAGAACGTAT 


CCAATTCGAA 


CA6TTTCCCA 


ATTAACAAGA 


GAAATTTATA 


960: 

* 


CAAACCCAGT 


ATTAGAAAAT 


TTTGATGGTA 


GTTTTCJ3AGG 


CTCGGCTCAG 


GGCATAGAAG 


1020 


GAAGTATTAG 


GAGTCCACAT 


TTGATGGATA 


TACTTAACAG 


TATAACCarC 


TA7ACGGATG 


1080" 


CTCATAGAGG 


AGAATATTAT 


T GGT CAGGGC 


ATCAAATAAT 


GGCTTCTCCT 


GTAGGGTTTT 


1140 
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ATTCACTTTT 


CCSC7A7A7G 


GTATTGTTGC 


TCAACTAGGT 


CAGGGCG7G7 


GACCTTTTAA 


TATAGGGATA 


AA7AA7CAAC 


CTTATGGAAC 


CTCCTCAAAT 


7TGCCA7CCG 


CGC7GGATGA 


AATACCGCCA 


CAGmA7AACA 


GATTAAGCCA 


TGTTTCAATG 


777CG77CAG 


GAGCTCCTAT 


G7TC7CTTGG 


ATACA7CG7A 


CACAAATTAC 


ACAAATACCT 


77AACAAAA7 


TTAAAGGAC2 


AGGAT7TACA 


GGAGGAGA7A 


CAACCTTAAG 


AGTAAATAT7 


AC7GC:ACCA7 


ACGCTTCTAC 


CACAAATTTA 


CaA77CCA7A 


GGAATTTTTC 


AGCAAC7ATG 


AG7AG7GSGA 


TAGGTTTTAC 


TACT CCGTT7 


AAC7777CAA 


ATGTCTTCAA 


77CAGGCAAT 


GAAG77TA7A 


TAACCTTTGA 


GGCAGAATAT 


GAT77AGAAA 


CTTCTTCCAA 


TCAAA7CGGG 


77AAAAACAG 


CCAATTTAGT 


TGAGTGTTTA 


7C7GA7GAA7 


AGAAAG7CAA 


ACATGCGAAG 


CSAC77A67G 


TTAGAGGGAT 


CAATAGACAA 


C7AGACCS7G 


AAGGAGGCGA 


TGACGTAT7C 


AAAGAGAA77 


GCTATCCAAC 


GTATTTATAT 


CAAAAAA7AG 


ACCAATTAAG 


AGG57ATATC 


GAAGA7AG7C 


ATGCCAAACA 


CGAAACA67A 


AA7G7SCCAG 


CAAGTCCAAT 


CGGAAAA7G7 


GCCCA7CA77 


GATG7ACAGA 


C77AAA7GAG 


GACTTAGGT G 


ATGGCCATGC 


AAGAC7AGGA 


AA7C7AGAA7 


CACTAGCTZG 


TG7GAAAAGA 


GCGGAGAAAA 
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GAAC7A7GGG 


AAA7GCAGC7 


CCACAACAAC 


1200 


A7AGAAC:A7T 


A7CG7CZAC7 


77A7A7AGAA 


1260 


AAC7A7C7G7 


7CTTSACGSG 


AC:AGAA777G 


1320 


C7G7A7ACAG 


AAAAAGCGGA 


ACGG7AGAT7 


1380 




7AGGCAAGGA 


TT7AG7C:ATC 


1440 


GC777AG7AA 


TAG7AG7G7A 


AG7A7AA7AA 


1500 


G7GC7GAA77 


TAA7AA7A7A 


A77CC77CA7 


1560 


C7AC7AA7C7 


TGGC7C7GGA 


AC77C7G7CG 


1620 


77C77CGaAG 


AAC77C:ACC7 


GGCCAGATTT 


1680 


7A7CACAAAG 


A7A7CGGG7A 


AGAA77CGC7 


1740 


CA7CaA77GA 


CGGAAGACC7 


A77AA7CAGG 


1800 


G7AA777ACA 


GTCCGGAAGC 


777AGGAC7G 


1860 


A7GGA7CAAG 


TG7A77TACG 


77AAG7GC7C 


1920 


7AGA7CGAA7 


7GaA777G77 


CwGGCAGAAG 


1980 


GAGCACAAAA 


GGCGG7GAA7 


GAGC7G777A 


2040 


A7G7GACGGA 


77A7CATA77 


GA7CAAGTA7 


2100 


T77G7C7GGA 


7GAAAAAAAA 


GAA77G7CC:G 


2160 


A7GAGCGGAA 


T77AC77CAA 


GA7CCAAAC7 


2220 


GC7GGAGAGG 


AAG7ACGGA7 


A77AC!::A7CC 


2280 


AC GTTAC G C T 


A77GGG7ACC 


777GA7GAG7 


2340 


A7GAG7CGAA 


A77AAAAGCC 


7A7ACCCG77 


2400 


AAGAC77AGA 


AA7C7A777A 


A77CGC7ACA 


2460 


G7ACGGG77C 




C777C:AGCCC 


2520 


C ^ CA^'CA*^*" *" 




A77GA7G7TG 


2580 




ATTCAAGATT 


aagacgcaag 


2640 


77C7CGAAGA 


GAAACCA77A 


GTAGGAGAAG 


2700 


AA7GGAGAGA 


CAAACG7GAA 


AAATTGGAAT 


2760 


153 
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TATTGTTTAX 


AAAGAGGCAA AAGAATCTGT 


AGATGCTTTA 


777G7AAAC7 


2820 




TAGATTACAA 


GCGGA7ACCA 


ACA7CGCGA7 


GATTCATGCG 


GCAGA7AAAC 


2880 


GCSTTCATAG 


CATTCGAGAA 


GCTTATCTGC 


C7GAGC7G7C 


TGTGAT7CCG 


GG7G7C:AA7G 


2940 


CGGCTATTTT 


TGAAGAATTA 


GAAGGGCGTA 


7777CAC7GC 


ATTCTCCCTA 


7A7GA7GCSA 


3000 


GaAATGTCAT 


TAAAAA7GGT 


GATT7TAATA 


A7GGC77A7C 


C7 GC7GSAAC 


G7GAAAGGGC 


3060 


ATGTAGATGT 


AGAAGAACAA 


AACAACCACC 


^ vvv ^1 ^^nt 


TGTTGTTCCG 


GAA7GGGAAG 


3120 


CAGaAGTGTC 


ACAAGAAGTT 


CGTGTCTGTC 




CTATATCCTT 


C37G7CaCAG 


3180 


CGT&CAAGS^ 


GGGATAT6GA 


GAAGGTTGCG 


7AACCA77CA 


TGAGATCGAG 


AACAA7ACAG 


3240 




GTTTAGCAAC 


TGTGTAGAAG 


AGGAAG7A7A 


TCCAAACAAC 


ACGG7AACG7 


3300 


GTAATGATTA 


TACTGCGACT 


CAAGAAGAAT 


A7GAGGG7AC 


GTACACTTCT 


CG7AA7CGAG 


3360 


GATATGACGG 


AGCCTATCaA 


AGCAATTCTT 


C7G7ACCAGC 


TGATTATGCA 


7C:AGCC7A7G 


3420 


AAGAAAAAGC 


ATATACAGar 


GGACGAAGAG 


ACAA7CC77G 


7GAATC7AAC 


AGAGSA7A7G 


3480 


GSGATTACAC 


AOdAC^ACdA 


GC7GGCTAT6 


7GACAAAAGA 


AT7AGAGTAC 


7?ccc:agaaa 


3540 


CCGATAAGGT 


ATGGAT7GAG 


ATCSGAGAAA 


CGSAAGSAAC 


ATTCATCGTG 


GACA6CG7GG 


3.600 

r 


AAT^ACTTCT 




TAATATATGC 


777A7AA7G7 


AAGS76TGCA AATAAAGAAT 


3660 


GATTACTGAC 


TTGTATTGAC 


AGA7AAATAA 


GGAAA77777 


ATA7GAATAA 


AAAACGGSCA 


3720 


TCACTCTTAA AAGAATGATG 


^^KdA^^B^Bto 


G7A7GA777A 


ACGAGTGATA 


777AAA7G77 


3780 




AGG CTTTACT 


TAACGGGG7A 


CCGCCACA7G 


CCCATCAACT 


7AAGaA777G 


3840 




AAG7GTCAAA 


AAACS77AT7 


C777C7AAAA 


AGC7AGCTAG 


AAAGGA7GAC 


3900 


ATTTTTTATG. 


AATCTTTCAA 


TTCAAGA7GA 


A7TACAACTA 


7TTTCTGAAG 


AGC7G7A7CG 


3960 


TCATTTAACC 


CCTTCTCTTT 


7GGAAGAAC7 


CSCTAAAGAA 


TTAGGTTTTG 


7AAAAAGAAA 


4020 


ACGAAAGTTT 


TCAGSAAATG 


AA7TAGC7AC 


CarXTGTATC 




AACG7ACAGC 


4080 


GAGTGATTCT 


CTC3TTCSAC 


7A7GCAG7c::a 


ATTACACGCC 


GCCACAGCAC 


7C77A7GAG7 


4140 


CCAGAAGGAC 


7CAATAAACG 


C777GA7AAA 


AAAGCGG77G 


AA77777GAA 




420g 


TCTGCATTAT 


GGAAAAGTAA 


AC777G7AAA ACA7CAGCCA 


777CAAG7GC 


AGCAC7CACG 


4260 


lATTTTCAAC 


GAATCCG7AT 


777AGA7GCG 


ACGATTTTCC 


AAG7ACCGAA 


ACATT7AGCA 


432T} 


CATGTATATC 


CTGGGTCAGG 


7GG77G7GCA 


CAAACTGCAG 






4360 
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a 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQDENCS CHARACTERISTICS: 

(A) LENGTH: 3474 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SODRCS:' 

(A) ORGANISM: Pure maize optimized synrhecic 3T CrylA(b) 



(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 2 



ATGGACAACA 


ACCCCAACAT 


CAACGAGTGC 


AT C C CC7A.CA 


ACTGCCTGAG 


CAACCCCSAG 


60 


GTGGAGGTGC 


• ^» ^^^PW • 


GCGCATCSAG 


ACCGGCTACA 


CCCCCATCGA 


CATCAGCCTG 


120 


AGCCTGACCC 


AG7TCCTGC7 


GAGCGAGTTC 


GTGCCCSSCG 


CCGGCTTCGT 


GCTGGGCCTG 


180 


GTGGACATCA 


TC7GGGGCAT 


CT7CGGCCCC 


AGCCAGTGGG 


ACGCCTTCCT 


GGTGCAGATC 


240 


GAGCAGCTGA 


TCAACZAGCG 


« 

CATCSAGGAG 


TTCGCCCGCA 


ACCAGGCCAT 


CAGCCGCwTG 


300 




GCaACCrTGTA 


CCAGA7CTAC 


GCCGAGAGCT 


TCCSCSA6TG 


GSAGGCCSAC 


360 




CCGCCCTGCG 


CSAGGAGATG 


CGCATCCAGT 


TCAACGACAT 


GAACAGCGCC 


420 


CTGAC2ACCG 


CwiA7C^wCC7 


GTTCGCCGTG 


CAGAAC7ACC 


AGGTGCCCCT 


6CTGA6CSTG 


480 


TAC ST GCAGG 


CCGwCAACw7 


GCACCTGAGC 


G7GC7GCGCG 


ACG7GAGCGT 


GTTCGGCCAG 


540 




TCSACSCwGC 


CACCATCAAC 


AGCCGC7ACA 


ACGACCTGAC 


CCGCCTGATC 


600 


GGCAACTACA 


C Z GACCACGC 


CG7GCGC7GG 


TACAACACCG 


GCCTGGAGCG 


CSTGTGGGGC 


660 


CCCGACAGCC 


GCSACTGSAT 


CCSC7ACAAC 


CAGTTCCGCC 


GCGAGCTGAC 


CCTGACC3TG 


720 


CTGGACATCG 




CC2CAACTAC 


GACAGC CGCA 


CCTACCCCAT 


CCGCACCS7G 


780 


AGCCAGCTGA 


CCCGCSAGAT 


C7ACACCAAC 


CCCGTGCTGG 


AGAACTTCGA 


CGGCAGC77C 


840 




CCCAGGGCAT 




ATCCGCAGCC 


CCCACCTGAT 


GGACATCCTG 


900 


AACAGCATCA 


CCATCTACAC 


CGACGCCCAC 


CGCGGCGAGT 


ACTACTGGAG 


C GGC CAC ClAG 


960 
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GCwCCGTGGG 


CTTCAGCGGC 


CG2GAGTTCA 




G7ACGGCACC 








GCAGCGCATC 




7GGGCCAGGG 


CG7GTACCGC 


1080 


ACZCTGAGCA 


GCACCCTG7A 


CCGCC3CCCC 


TTCAACATCG 


GCATCAACAA 


CCAGCAGCTG 


1140 


AGCGTGCTGG 




GTTCGCCTAC 


GGCACCAGCA 


GCAACGTGCC 


CAGCGCCGTG 


1200 


■TACCGCAAGA 


GCGGCACCGT 


GGACAGCC7G 


GACGAGATCC 


CCCCCCAGAA 


CAACAACGTG 


1260, 


CCCCZZCGCC 


AGGGCTTCAG 


CCACCSCCTG 


AGCCACS7GA 


GCATGTTCCG 


CAGCGGCTTC 


1320 


AGCAACAGCA 


GCSTGAGCXT 


CATCCGC3CC 


CCCATGTTCA 


GCTG3ATCCA 


CCSCA6CSCC 


1380 


GAGTTCaACA 


ACATCATCCC 


CAGCAGCCAG 


ATCACCCAGA 


TCCCCCTGAC 


CAAGAGCACC 


1440 . 


AACCTGGGCA 


GCGGCACCAG 


CGT6GTGAAG 




TCACCGGCGG 


CGACATCCTG 


loOO 


CGCCGw^CA 




GATCAGCACC 


CTGCGCGTGA 


ACATCaCCGC 


CC CCC7GAGC 


1560 


CAGCGCTACC 


GCGTGCGCAT 


CCGC7ACGCC 


AGCACCACCA 


ACCTGCAGTT 


CCACACCAGC 


1S20 


ATCSaCGGCC 


GCCCCATCXA 


CCAGGGCAAC 




CCATGAGCAG 


CGGCAGCAAC 


ISBO 


CTGCAGAGCG 


GCAGCTTCCS 


CACCGTGGGC 


T7CACCACCC 


CCTTCaaCTT 


CAGCAACSGC 


1740 


AGCAGCSTG7 


TCACCC7GAG 


CGCCCAC67G 


T7CAACVGCS 


GCAACSAG&T 


GXIUZATCGAC 


1800 


CGCATCGAGT 


TCGTGCCCGC 


CGAGGTGACC 


TTCG&GviCwG 


AG7AC6ACCT 


GGA6CGCSCC 


1860 


CAGAAGGCCG 


TGAACSAGCT 


GTTCACCA6C 


AGCAACCAGA 


TCGGCCTGAA 


GACCGACGTG 


1920 


ACC3ACTACC 


ACATCSACCA 


GS7GAGCAAC 


CTGGTGGaGT 


GCCTGAGCSA 


CGAGTTCTGC 


1980 


CTGGACGAGA 


AQAAGGAGCT 


GAGCSAGAAG 


GTGAAGCACG 


CCAAGCGCCT 


GAGCGACGAG 


2040 


CSCAACC7GC 


TGCAGGACCC 


CAACTTCCGC 


GGCATCAACC 


GCCAGCTG&& 


CCGCG6CTGG 


2100 


C GCGGCIAGCA 


CCSACATCAC 


CATCCAGGGC 


G5CGACGACG 


TGTTCAAGGA 


GAACTACGTG 


2160 


ACCCTGCTGG 


GCACC?TCSA 


CSAG7GCTAC 


0 w GiAG w7AC w 


TGTACCAGAA 


GATCGACGAG 


2220 


AGCAAGCTSA 


AGSCCTACAC 


CCSCTACCAG 




ACATCGAGGA 


CAGCCAGGAC 


M 

2280 


CTGGAGATCT 


ACCTGATCCG 


CTACU^CGCC 


AAGCACOAGA 


CCGTGaACGT. 




2340 


GGCAGCCTGT 




daw ws« W h-wAU w 


CCCATCGGCA 


AGTGCGCCCA 


CCACAGCCAC 


240a 

<* 


CACTTCAGCC 


TGGACATCGA 


CSTG5GCTGC 


ACCGACC7GA 


ACGAGGACCT 


GGGCGTGTGG 


2460 


GTGATCTTCA 


AGATCAAGAC 


CCAGGACGGC 




TGGGCAACCT 


GGAGTTCCTG 


2520 


GAGGAGAAGC 


CCC7GGTGGG 


CSAGGCCwTG 


GCCCGCGTGA 
166 




GAAGAAGTGG 


2580 
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CGCGACAAGC GCGAGAAGCT GGAGTGGGAG ACCAACATCG TGTACAAGGA GGCCAAGGAG 2640 

AGCGTGGACG C3CTGTTCGT GAACAGCCAG TACGACCGCC TGCAGGCC3A GACCaACATC 2700 

GCZATGATCC ACGCCGCCGA GAAGCGCGTG CACAGCA.TCC GCGAGGCCTA CCTGCCCGAG 27 SO 

CTGAGCGTGA TCCCC3GCGT GAAC3CC3CC ATCTTCGAGG AGCTGGAGGG CCGCATCTTC 2820 

ACwGCCTTCA GCCrGTACGA CGCCCGCAAC GTGATCAAGA ACGGC3ACTT CAACAAC3GC 2880 

CTGAGCTGCT GGAACGTGAA GGGCCACGTG GACGTGGAGG AGCAGAACAA CCACCGCAGC 2940 

GTGCTGGTGG TGCCCGAGTG GGAGGCCGAG GTGAGCCAGG AGGTGCGCST GTGCCCCGGC 3000 

CGCGGCTACA TCC7GCGCGT QACCGCCTAC AAGGAGGGCT ACGGCGAGGG CTGCGTGACC 3060 

ATCCACGAGA TCGAGAACAA CACCGACGAG CTGAAGTTCA GCAACTGCGT GGAGGAGGAG 3120 

G7GTACCCZA ACAACACCG7 GACCTGCAAC GAC7ACACCG CCACCCAGGA GGAG7ACGAG 3180 

GGCACZ7ACA CCAGCCGCAA CCGCGGC7AC GACGGCGCC7 ACGAGAGCAA CAGCaGCG7G 3240 

CCCGCCGAC7 ACGCCAGCGC C7ACGAGGAG AAGGCC7ACA CCGACGGCCG CCGCGACAAC 3300 

CCC7GCGAGA GCAACCGCGG C7ACGGCGAC 7ACACCCCCC 7GCCCGCCGG C7ACG7GaCC 3360 

AAGGAGC7GG AGTAC77CCC CGAGACCGAC AAGG7G7GGA 7CGAGA7CGG CGAGACCGAG 3420 

GGCACZ77CA 7CG7GGACAG CG7GGAGC7G C7GC7GATGG AGGaG7AG7A CA7G 3474 
(2) INF0RMA7I0N TOR SZQ ID NO: 3: 

(i) SEQUENCE CHaRAC7ERIS7ICS : 

(A) LZNG7H: 1961 base pairs 

(B) 7YPE: nucleic acid 

(C) S7RANDEDNESS : single 
<D) 70P0L0GY: linear 



(ii) MOLECULE 7YPE: DHA 
(iii) HYP07HE7ICAL: NO 
(iv) AN7I-3ENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Truncarsd synrhenic maise apcimized 37 
CrylA(b) gene 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
^ GA7CC:aACAA TGGACAACAA CCCCAACA7C AACGAG7GCA 7CCCC7ACAA C7GCC7GAGC 50 
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AACCCCGAGG 


TGGAGGTGCT 


GGGCGGCGAS 


CGCAICGAGA 


C C G GC7ACAC 


w^<«»wvx wo<iW 


120 


ATCAGCCTGA 


GCCTGACCCA 


GTTCCTGCTG 


AGCGAGT? C G 


XOWwwOOWOW 


woow* «woXo 


ISO 


CSGGGCCTGG 


TGGACATCAT 


C7GGGGCATC 


• XWOOWww<M-A 


GwGAGTGGGA 


CGCw77Cw7G 


240 


GTGCaGATCG AGCAGCTGAT 


CAACCAGCGC 


A7CSA6GAG7 


X wowwWowAa 


CwAGGCCA7C 


300 






CAACCTGTAC 


CAAATC7ACS 


CGGAGAGCTT 


w^owoAo X (io 


360. 

•J 






CGCCCTGCGC 


GAGGAGA7GC 


GCATCCAGTT 


CAACGACA7G 


420 






CATCCCCCTG 


77CGCCG7GC 


AGAAC7ACCA 


GG7GCCCw7G 


480 






CGCCAACCTG 


CACCTGA6CG 


TGCTGCGCGA 


WO X wAiawo X o 


o40 


TTCGuwwAGw 




CGACGCCGCC 


ACCA7CAACA 


GCCGC7ACAA 


CGACC7GACC 


600 




o winU X ALiAjU 


cs&cc&cscc 


^* ^* *^ ^••W 

OX OWOW X OO X 


ACAACACCGG 


CC7GrGAGCGC 


660 






CS&C7GGA7C 


AGG7ACAACC 


AGTTCCGCCG 


CGAGC7GACC 


j% 

720 






GaGCCTGTTC 


CCCAAC7ACG 


ACAGCwGCAC 


C7ACCCCA7C 


780 


UiswiJU wLa X van 




CCGCGAGAT7 


7ACACZA&CC 


CCo7GC7GGA 


GAAC77CGAC 


340 




Uso Lao wiowow 


CCAGSGC&TC 


G&GGGCAGCA 


TCCowAGCww 


CCACC7GATG 


900 








GACGCCCACC 


GCGGCGAG7A 


C7AC7GGAGC 


960 




^ VmhAX 00\^ waO 


CCCCSTC3GC 


. 1 U Ala woo w w 


CwGAGTTCAC 


W^ ^WwWwM^O 


1020 






TGCACCTCAG 


CAGCGCX7CG 


TGGCACAGCT 


GGGwCAGGGA 


1080 






CACCCTGTAC 


CGxCGACCxT 


TCAACA7CGG 


ca7CAAc:aac 


1140 


VaAtJ^ AO W X OXl 


OWO X O W A OOA 


CGGCACCGAG 


TTCGCCTACQ 


GCACCAGCAG 


CaACC7GCCC 


1200 




ACCSCAAGAG 


CGGCACwGTG 


GACAGCCTGG 


ACGAGA.7CCC 


CCw7CAGAAC 


1260 


AACAACGTGC 


CACCTCSaCA 


GGGC77CAGC 


CACCGTCTGA 


GCCACG7GAG 


CA7G77CCGC 


1320 


AGTGGCTTCA 


GCAACAGCAG 


CSTGAGCXTC 


ATCCGTGCAC 


C7A7G77CAG 


C7GGA77CAC 


1380 


CGCAGTGCCG 


AGTTCAACAA 


CATCATCCCC 


AGCAGwCAGA 


TCACCCAGA7 


CCCCC7GACC 


1440 


AAGAGCACCA 


ACCTGGGCAG 


CGGCAC CAGC 


GTGGTGAAGG 


OWww«^OOWa X 


UAWwOOWOOW 


1500 

• 


GACA7CCTGC 


GCCGCACCAG 


WwwwOOWwio 


ATCAGCACCC 


7GCGCG7GAA 


CA7CACCGwC 


1560 




AGCGCxACwG 


CGTCCGCATC 


wow aAwoWwa 


GCACCACCAA 


CC7GCAG77C 


1626 


CaCACZAGCA 


TCGACGGCCG 


CCCCATCAAC 


CAGGGCAACT 


TCAGwGwCAC 


CA7GAGC:;^C 


1680 
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GGCAGCAACC TGCAGAGCGG CAGCTTCCGC ACCGTGGGCT TCACCACCCC CTTCAACTTC 1740 

AGCAACGGCA GCAGCGTGTT CACCCTGAGC GCCSACGTGT TCAACAGCGG CAACGAGG7G 1800 

TACATCGACC GCATCGAGT? CGTGCGGGCG GAGGTGACCT TCGAGGCCGA GTACGACCTG 1860 

GAGAGGGCTC AGAAGSCCST GAACSAGC7S 77CACCAGCA GCAXCCAGAT CSGCC7GAAG 1920 

- ACCGAC37GA CCGACTACCA CATCGATCAG GTGTAGGAGC T 1961 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3508 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEONESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Full lengrii synriietic maize opuioized 3T 
CrylA(b) gene 



Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



GATCCAACAA 


TGGACAACAA 


CCCCAACATC 


AACGAGTGCA 


TCCCCTACAA 


CTGCC7G1AGC 


60 


AACCCCGAGG 


TGGAGGTGCT 


GGGCGGCSAG 


CGCATCSAGA 


CCGGCTACAC 


CCCCATCGAC 


120 


ATCAGCCTGA 


GCCTGACCCA 


GT7CC7GCTG 


AGCGAGTTCG 


TGCCCGGCGC 


CGGC7TCGTG 


180 


C7GGGCCTGG 


TGSACATCAT 


CTGGSGCATC 


TTCGGCCCCA 


GCCAGTGGGA 


CGCCTTCC7G 


240 


GTGCAGATCG 


AGCAGCTGAT 


CAACCAGCGC 


ATCGAGGAGT 


TCGCCCGCAA 


CCAGGCCATC 


300 


AGCwGCCTGG 


AGGGCC7GAG 


CAACCTGTAC 


CAAATC7ACG 


CCGAGAGCTT 


CCGCGASTGG 


360 


GAGGCCIGACC 




CGCCC7GCGC 


GAGGAGA7GC 


GCATCCAGTT 


CAACGACATG 


420 


AACAGCGCGC 


TGACGACCGC 


CATCCCCCTG 


TTCGCC3TGC 


AGAAC7ACCA 




480 




ACGTGCAGGG 


CGCCAACC7G 


CACCTSAGCG 


TGCTGCGCGA 


CG7CAGGG7G 


540 


TTCGGCCAGC 


GCTGGGGCTT 


CGACGCCGCC 


ACCATCAACA 


GCCGCTACAA 


CGACCTGACC 


600 


CGCCTGATCG 


GCAACTACAC 


CGACdAiCGGG 


GTGCGCTGGT 


ACAACACCGG 


CCTGGAGCGC 


660 


GTGTGGGGTC 


CwGACAGCCG 


CGACTGGATC 


AGGTACAACC 


AGTTCCGCCG 


CGAGCTGACC 


720 
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T G GACATCGT 


GAGCCTGTTC 




ACAGC C GCAC 




780 




GCwAGCTGAC 


CCGCGAGATT 


T ACAC mAAC C 


X W3\« • UVATl 


GAACTTCGAC 


840 




GCGGCAGCGC 


CCAGGGCATC 


GAGGGCAGCA 




CCACCTGATG 


900 




ACAGCATCAC 


CATCTACACC 


GACGCCCACC 


GCGGCGAGTA 


CTACTGGAGC 


960*" 




TCATGGCCAG 


CCCCGTCGGC 


T7C&GCGGCC 


CCGaG77CAC 


CTTCCCCCTG 


1020, 


TACGGCaCCA 


TGGGCAACGC 


TGCACCTCAG 


CAGCGCATCG 


7GGCAC:AGC7 




1080 


GTGTACCGC:i 


CCCTGAGCAG 


CACCCTGTAC 


CGTCGaCCTT 


TCaACA7CGG 


CA7CAACAAC 


1140 


CAGCAGCTGA 


GCGTGCTGGA 


CGGCACCSAG 


TTCGCCTACG 


GCACCAGCA6 


CAACCTGCCC 


1200 


AGCGCCGTGT 


ACCGCAAGAG 


CGGCACCGTG 


GACAGCC7GG 


ACSAGA7CCC 


CCC7CASAAC 


1260 


AACAACGTGC 


CACCTCGACA 


GGGCTTCAGC 


CaCCGTCTGA 


SCCACG7GAG 


Ca7G77CCGC 


1320 


AGTGGCTTCA 


GCAACAGCAG 


CGTGAGCATC 


ATCCGTGCaC 


C7A7G77CaG 


C7GGA77CAC 


1380 


CGCaGTGCCG 


AGTTCAACAA 


CATCATCCCC 




7CACCCAiGA7 


CSCCC7GACC 


1440 


AAGAGCACCA 


ACCTGGGCAG 


CGGCACCAGC 


GTGG7GAAGG 


GCCCCGGC77 


CACCSGCSGC 


1500 


GACATCCTGC 


GCCGCACCA6 




A7CA6C:ACCC 


7GCSCG7GAA 


CA7CACCSCC 


1560 


CCCCTGAGCw 


AGCSCTACZS 


CGTCCGcarc 


CSC7ACGCCA 


GCACCACCAA 


CC7GCAG7TC 


1620 






CCCCATCaAC 


CAGSGCAAC7 


7CAGCGCCAC 


CA7GA6CAGC 


1680 


GGCAGCAACC 


TSCAGAGCSG 


CAGcrrccsc 


ACC37GSGC7 


7CACCaCCCC 


C77CaAC77C 


1740 


AGCAACGGCA 


GCAGCSTGT? 


CACCC7GAGC 


GCi:CaC37G7 


7CAACAGC3G 


CAACSAGG7G 


1800 


TACATCGACC 


GCATCGAGTT 


CSTGCCCGCC 


GAGG7GACC7 


TCGAGGCCGA 


GTACGACC7G 


1860 


GAGAGGGCTC 


AGAA6GCCGT 


GAACSAGCTG 


77CACZAGCA 


GCAACCAGA7 


CGGCC7GAAG 


1920 


ACCSACGTGA 


CCGACTACCA 


CASCGATCAG 


G7GA6CAACC 


7GG7GGAG7G 


CC7GAGCSAC 


1980 


GaGTTCTGCC 


TGGACG&GSA 


GAAGGAGCTG 


AGCSAGAAGG 


TGAAGCACGC 


CAAGCGCC7G 


2040 


AGCGACSAGC 


GCAACC7GCT 


GCA6GACCCC 


AAC77CCGCG 


GCA7CAACCG 


C2AGC7GGAC 


2100 




G C GGCAGCAC 




AT CCAGGGC G 


GC GAC GAC GT 


G77CAAGSAG 


2160 

* 


AAC7ACSTSA 


CCZTGCTGGG 


CACC7TCGAC 


GAG7GC7ACC 


CZACCTACCr 


g7acc:agaag 


2220 




GCAAGC7GAA 


GGCw'JACACC 


CGCTACCAGC 




CA7CSAGGAC 


228^ 


AGCCAGGAC C 


TGGAGATCTA 


CCTGATCCGC 


rACAACGCCA 


AGCACGAGAC 


CG7GAACS7G 


2340 
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W w ik« W^U w o 


/^«^ /"^ »^ w< m f* 








^« w 


2400 








X W\3V9V^ ^ W 


C2GACCTGAA 




2460 






GATCAAGACC 




S# A 


\^ MB ^ w w 


2520 


i T '*'r* 




WW X WW ^ w WW W 




C!22SCGTGAA 

W wwwww X 


^7 M «0 


2580 






CGAGAAGCTG 




•^^AACATCGtr 


G7ACAAGGAG 


2640 




mm ^# ^tfiA m^ 


CCTG7TCGTG 


AACAGC2AGT 




^^^^^^ M 

Lai— Auruw ^ ^ ^*4i - 


2700 


ACCAACATCG 


CCATGATCCA 


^^"^ 




ACAGCATTCG 


W^aAwVSWw ^ AW 


2760 


\^ ^ «^ \^ «^ ^rfA 


•^GAGCGrGAT 


%^ A «v 


* m^ %^ 


'^CTTCGAGGA 


LSW X Ui3Ai3UOW 


2820 






CCTGTACGAC 

Ww A w Arfaiw wf*^^ 


GCCCGCAACG 

WWwWw W*«f%Ww 


rsATCAAGAA 


I ^<Xi - ^ _ [ . 


2880 


lACAACGGCC 


m ^ ^•f* ^ «m 


GAACGTGAAG 

wvadr*v#s3 X wn**%0 


/^^^^^ <^>^»fl^^ 


ACSTGGAGGA 

AW w X wwf^wUA 


GCAGAACAAC 


2940 




* WW « WW A WW X 


wwwwwtnw X WW 


VJXMTw W w wCnW w 


TGAGCrAGGA 


X W^^WWW X W 


3000 

w www 










AwwAwwwW 




3060 

W w w w 


TGcaTGACCA 




CGAGAACAAC 


ACJ^SACSAGC 


'^CAAGT''*CAG 

A WAAw X w WA w 


^AACTGGGTG 

W^liAW X w W W X w 


3120 


^A.C^aA.G^sA.CG 


TGTACCCCAA 


CAACACCSTG 

WAAlii»flW*i»w X W 


ACOTCrAACG 

AW w X O WAAw w 


ACTACAC^^GG 

AW X AWAW w w W 


CACGC!AGGAG 

WAiW W Wi<XW«Xf1kw 


3180 


GAGTACGAGG 




CAGCCGCAAC 


W W W w\SI ^« w 


ACGGGGG^!TA 

AWwwWwWw X** 


CGAGAGCAAC 


3240 


* ^ 




>^ WiCa>^0>« %A N» 


'^ACGAGGAGA 


AGGC^AGAC 

•MSwWw »AWrf%W 


^W*XWWwW WWW 


•rf ^ w w 


CGCGACAACC 




CAACCGCGGC 


m» ^ ^ 

• AWo\9wonw A 


ACACwWwww* 


VjWwwOWWOWW 


3360 




AGGAGC7 GGA 


GTACTTCCCC 


GAGACGGACA 


AGGTGTGGAT 


CGAGATCGGw 


3420 


GAGACCGAGG 


GCACCTTCAT 


CGTGGACAGC 


O 1 V9\3Ai3W X U W 


TGCTGATG3A 


GGAGTAGTAC 


3480 


ATG7GATAGT 


ACGTAAGCTC 










3508 



(2) INFORMATION FOR SEQ ID N0:5: 

(i) SSQUENCZ CHARACTERISTICS: 

(A) LENGTH: 1345 base pairs 

(B) TYPE: nucleic acid 

(C) 3TRANDEDNES5 : single 

(D) TOPOLOGY: linear 

(ii) MGLZCULZ TYPE: DNA 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Truncared svnriieric 3T aene (Perlak er ai.) 
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(xi) SEQUENCS DESCRIPTION: SEQ ID N0:5: 

ATGGACAACA ACCCAAACAT CAACGAATGC ATTCCATACA ACTGCTTGAG TAACCCAGAA SO ; 

GTTGAAGTAC TTGGTGGAGA ACGCATTGAA ACCGGTTACA CTCGCATCGA CATCTCCTTG 120 

TCCrTGACAC AGTTTCrGCT CAGCGAGTTC GTGCCAGGTG CTGGGTTCGT TCTCGGACTA 180 ' 

GTTGACATCA TCTGGGGTAT CTTTGGTCCA TCTCAATGGG ATGCATTCCT GGTGCAAATT 240 

GAGCAGT?GA TCAACCAGAG GATCGAAGAG TTCGCCAGGA ACCAGGCCAT CTCTAGGTTG 300 

m 

GAAGGATTGA GCAATCTCTA CCAAATCTAT GCAGAGAGCT TCAGAGAGrG GGAAGCCGAT 360 

CCTACTAACC CAGCTCTCCG CGAGGAAATG CGTATTCAAT TCAACGACAT GAACAGCGCC 420 

TTGACCACAG CTATCCCATT GTTCGCAGTC CAGAACTACC AAGTTCCTCT CTTGTCCGTG 480 

TACGTTCAAG CAGCTAATCT TCACCTCAGC GTGCTTC3AG ACGTTAGCGT G7TTGGGCAA 540 

AGGTGGGGAT TCGATGCTGC AACCATCAAT AGCCGTTACA ACGACCTTAC TAGGCTGATT SOO 

GGAAACTACa. CCGACCaCGC TGTTCGTTGG TACAACACTG GCTTGGAGCG TGTCTGGGGT 660 

CCTGATTCTA GAGATTGGAT TAGATACAAC CAGTTCAGGA GAGAATTGAC CCTCACAGT? 720 

TTGGACATTG TGTCrCTCTT CCCGAACTAT GACTCCAGAA CCTACCCTAT CCGTACAGTG 780 

TCCCAAC7TA CCAGAGAAAT CTATACTAAC CCAGTTCTTG AGAACTTCGA CGGTAGCTTC 840 

CGTGGTTCTG CCCAAGGTAT CGAAGGCTCC ATCAGGAGCC CACACTTGAT GGACATCTTG 900 

AACAGCATAA C?ATC?ACAG CGATGCTCAC AGAG5AGAG7 A7TACTGGTC TGGACACCAG 960 

AICA7GGCC7 C7CCAG7TGG A77CAGCGGG CCCGAG777A CC777CC7C7 C7A7GGAAC7 1020 

ATGGGAAACG CCGCTCCACA ACAACG7A7C G77GC7CAAC 7AGG7CAGGG TG7C7ACAGA 1080 

ACC77G7C77 CCACC7TG7A CAGAAGACCC 77CAATA7CG G7A7CAACAA CCAGCAAC77 1140 

TCCG77C77G ACGGAACAGA G7TCGCC7AT GGAACC7C77 C7AAC77GCC A7CCGC7G77 1200 

7ACAGAAAGA GCGGAACCG7 7GA77CC77G GACGAAA7CC CACCACAGAA CAACAATG7G 1260 

CCACCCAGGC AAGGA77C7C CCACAGG77G AGCCACG7G7 CCA7G77CCG T7CCGGA77C 1320* 

AGCaACAG77 CCG7GAGCA7 CArCAGAGC7 CC7A7G77C7 CATGGA77CA TCG7AG7GC7 1380^ 

GAG77CAACA A7A7CA77CC T7CC7C7CAA ATCACCZAAA TCCCA77GAC CAAG7C7AC7 1440 

AACC77GGA7 C7GGAAC77C TG7CG7GAAA GGACCAGGCT TCACAGGAGG 7GA7A77C7T IS 00 
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AGAAGAACTT 


CTCCrCGCCA 


GATTAGCACC 


C7CAGAG7TA 


ACA7CAC7GC 




1560 


CAAAGATATC 


GT6TCAGGAT 


TCS7TACGCA 


TC7ACCAC7A 


ACTTGCAATT 




1620 


ATCGAC3GAA 


GGCC7ATCAA 


TCAG6GTAAC 


TTCTCCGCAA 


CCATGTCAAG 


CGGCAGCAAC 


1680 


TTGCAATCCG 


SCAGC7TCAG 


AACCGTCSST 


TTCAC7ACTC 


CTT7CAACTT 


CTCTAACGGA 


1740 


TCaAGCGTTT 


TCaCCCTTAG 


CGCTCATGTG 


TTCaATTCTG 


GCAA7GAAGT 


GTACATTGAC 


1800 


CGTATTGAGT 


TTGTGCCTGC 


C6AA6TTACC 


TTCGAGGCTG 


AGTAC 




1845 



3 
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INDICATtONS REI^TTNG TO A DEPOSITED MICROORGANISM 

(PCTRuJc 13 to) 



A. ThexBdicaxioiisinadebeiawTeUteto 
oa page 159 



Allies 



letened to in the descnpuon 
21-25 




B- XDENTDFICATION OF DEPOSIT 
Name of deposiiary insdnitioa 



Funber deposits are idenu'fied on an additional sfaeet | | 




Addietf of depositary issticmiaa C'udlatUigpaaa i rod e a n d rm t i ir y) 



Peoriar H. 61604 

n.s.A. 



Date of deposit 

4 October (04.10.91) 



Accession Ntnnber 
HRRL B 18891 



AS}UTa01MLJSDlCX110m(lap^hia«^^ ll»is fawmution is cewiiimcd on an additi^ □ 



We request: t:he ^jm^ Solution vbere available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if (te 



are net for ^ desig n a ted Siittn i 



E. SEPARATE FURNISHING OF INDICATIONS {ImeblaakifnaappGablej 



Tbeiodicatioos listed below will besnbmitled to the Interaaiionai Bureau Izterixprnfyihegamiiiawra^ tbeuiMieoiioiae.g^ 'A 



For xeoeiving Office use only 



Tliis sheet was reoesved with tbeistenatiosalappiicasio& 



ABtbodzed ofEioer 



For lAtenatioiaj Bureau ose only 



tiiis sheet was received by tbelntetutional 



Bureau ofc 



Authonzed oOioer 



Fwm PCr;R0/I34 (July 1992) . 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(?CT Rule I3bis) 



A. The xndtcauom xiude below relate to the mkreorganB 
on oaee 159 . lines 


izn retened to in tbe descnptkm 
21-25 




B. IDENTIFICATION OF DEPOSIT 


Funher deposits are identified on an additional sbe 




Name of deposiury institution 

Agrldtltuzal Research CoXtore CoXXc 


sctixon (HBBIa} 





Address o f depositary insitmtioa ftncitiduig poaai code end cauay) 

18X5 Hortfa X3xi±vmra±cr Street 
Peorxa# XL 6X604 
U.S. A. 



Date ot* deposit 

4 October X991 (04.X0.9X) 



Accession Ntnnfoer 
HRRL B 18892 



C ADDITIONAL INDICATIONS O^hlamk^aei eppiieahi^ Tbis ia&mmtion is ^rimwl 



on an 



□ 



We request the Sip e xL SoXntxcm vhere avaiXabXi 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE OfthciiidkatioesmnaforaUdeuputt^Ss^ 



E. SEPARATE FURNISHING OF INDICATIONS (/anwAia^ 

Tbe indications listed below will besubmsned to tbe InteniatioiMl 
Number of Depaail') 



Bumu izttrispee^ihegmaniiimureeftheu^ieatiam€.p, 'Aceahm 



□ Ilia 



" For receivmg 0£Qoe use only 



sheet was received with the inienational appixation 



Authorized oflocer 



For Intenuttonal Burean use only 



n Tbis sheet was received by the intexnational Burean on: 



Attthonzed o£Dcer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuie I2bis) 



Am- UsemoicatiaosniadebeiowTeiaieto 



, lines 



retened to in tbe descnpuon 



B. IDENTIFrCATION OF DEPOSIT 



Fditfaer deposits are ide&tiGcd on an additional sbeet 



Name of depositary insutuiion 
AgricnltaraX Reeearch 



Culture Qollection (HBRL) 



Addxess of depositary 



aon (induing postal code atid caaiury) 



, XL 61604 



Dace of deposit 

3 September 1992 (03.09.92) 



Accession Number 
SRRL B 18994 



G. ADDmONALINDICATK)N50eMi;^^ju[ttf;p£^^ Tliis iafbimationiscantimiedasanadditiooalslieet Q 



116 xequegt the 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE 



(ifihe umeauamsoremptfor •& dapiateiSime^ 



£. SEPARATE FURNISHING OF INDICATIONS (Imehknkifttet app6eabl^ 



HieindicatioDS listed beiowwill besobmined to tbe intenationai Bureau Izterispedjyihegaiamiiatarciifthe 
NtmeerofD€posu') 



0£Coe use ooly 



\ I Hussbeetwasxeceivedwitfatbexntenatioiuia 



Autbcdzed o£C( 



For IntcntatioBai Bureau ooe only 



n Hiissbeet^vasreoBiwedby the Intenationai Bureau 



ofCi 



Fona ?CrjR0/n4 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MIC3100RGANISM 

(PCr Ruic 13bis) 



A. Tbe indications made beiow reiaie to the microarganisci referred to in the description 
onoace 159 , lines 21-25 


B, IDENTIFICATION OF DEPOSIT 


Further dei>osits are identified on an addittcnal six 


« □ 


Name of deposiury institution 

AgxicnXttiraX Researcb CaXtxire GoUoetij 


on (HRSL) 






Aikfaess of depositary institution fuiekuluigpottMieodemmdeauary) 






1815 Horth Qaiverslty Street 
Peoria, XL 61604 








Date of deposit 
3 Septeaber 1992 (03.09.92) 


Accession Number 
HBBL B 18995 






C ADDITIONAL INDICATIONS (Uami^^ma appGcable) Ibis inforniaticm is oontix 


oed on s& ii^ifft**iwf she 




Ife request the Sspert SoltttioB vbere ar 


vexlable 






D. DESIGNATED STATES FOR WmCH INDICATIONS ARE MADE Gftheindiaak 


MP are net for etf Jatgrnatfii Sitta) 




£. SEPARATE FURNISUING OF INDICATIONS (ZmiO^ 


The indications listed below will besttfamioed to the imemstianal niirr %\\ i i rrr ( rfunfr rtir jtm ami nmmw iff f fcr imHiniumi r j; , 'Arrrnkm 
NamoerofBeposit'^ 



For receiving 0£Qce use only 



I I This sheet was received with the iniernatiottai application 



Authorized officer 



For intemationai Bnrein ose only 



rn Tbis sheet was received by the intonatioiul Buieao 



Authorized ofCcer 



Fbnn PCr/R0/I34 (Juiy 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule Ubis) 




A. The indications made below reiaie co the mictoongaaisni retened to in ihe descnption 
on page 159 .lines 21-25 

B. IDENTIFICATION OFDEPOSH 
Name of deoositary insdtuiion 



Fonfaer deposits are identined on an additional sheet f**] 



Addxcss of depositary institatio& (tnHttdiiig pomlcodcand eatMry) 



(HRBL) 



Seoriar II* 61604 
II.S.A. 



Date of deposit 

4 Septnber 1992 (04.09.92) 



Accession Number 
5BBL B 18996 



C ADDIIIONAL INDICATIONS (UavehioBk^aa appiieahU) This infonnatioa a 




on an addttiooai sbeet 



D. DESIGNATED STATES FDR WHICH INDICATIONS ARE MADE (iftheia£eadoiumiwtfordil daiputedStae^ 



£. SEPARATE FURNISHING OF INDICATIONS (Imcblaakifttot eppSad^ 



The indications listed beicrnr will besobmitzed to the imenuiionai Bttreao iitaispcafythc^aitaniammei 
Namoer of Deposit') 



For r e cei ving OIEcb use ooiy 



I I This sheetwasxeceived with tbeiaiernationa] application 



Autfaocized ofljcrr 



□ 



IntemtioQaJ Bureaa ose only 
was recBfwed by tbe ImenutiaBal 




Pwm PCryRO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 12bis) 



A. Tbe indications made beiow reiate lo the miooorgantsm referred to tn the 
on page 159 .lines 21-25 



B. IDENTIFICATION OF DEPOSrr 



Ftmher deposits are identified on an additiooai sheet j~{ 



Name of deposiury institution 

Agricultural Besearch Ctilttzre fv>TT<»r^Tiot« (HRBL) 



Address of depositary institution (i 



paatai codeama cauaay) 



1815 Borth tXaxvenlty Street 

BeoriAr IL 61604 

n.S.A. 



Date of deposit 
21 September 



1992 (21.09.92) 



Number 
3 18997 



C ADDITIONAL INDICATTONS {leave hkmk^aet eppaeabU^ Dm infomntsoa » cantinued on 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCr Rnlc Ubis) 



A. Ibe indicapois made beiow rdate to the micn>orgamsm reta 
onoaee 159 .lines 21" 


Ted to in the description 
-25 


* 


B* IDENTIFICATION OF DEPOSIT 




Furtber deoostts are 

* 


identtCed an an addtUonaJ sheet J J 


Name of depositary insdmtion 

Agrlcnltural Research Oil rare GoUecta 


XX 


a (HBRL) 




Address of depositary instimtion (tncludiagpossci code and. caamryj 






1B15 Hbr^ DkixyersxtT Street 

Peozoa, H* 61604 

U.S.A. 








Date of deposit 

21 Septressber 1392 (21.09.92) 


/ 
I 


ioccssion ptutDoct 
IRBL B 18998 




O ADDITIONAL INDICATIONS (kamUaidcifMU appSeabl^ 


This informattoa is € 


ontiiiiiedaiianadditraDaisiieet | | 1 
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tUaUe 
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Number of Dmsu^ 



0£Coe use only 




Form PCr/RO/134 (July 1992) 



For IntematiooaJ B 



only 



llitt sheet ^x«i«ceii«d by the ImeraatiooalBureaii 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



(PCT Ruic I2bis) 


A. Tbe indications made beiow relate to the r 
on paf^ 159 


ucroorganisin referred to in the description 
. lines 21-25 




B. IDENTIFTCATION OF DEPOSIT 


Funber deoosits are identified an an additiooi 


tishttx □ 


Nameot deposiury institution 






Agrlcaltnml Research COXtt: 


re CoUaction (HBSL) 





Address of depositary xnstitutioo (inei u dtn^ postal code a a rf rrmmr y) 

1BI5 56rth Oaxverslty St ree t 

FeordLa, Us 61604 

XT.S.A. 



Date of deposit 

21 Septnber 1992 (21*09.92) 



Accession Number 
L B 18999 



C ADDmONAL INDICATIONS (leave blamkif met appSeabi^ Tim infomiation is Goadnued on an additianal sheet Q 



We request the 'g'j** ''^ Solntloa ivhere available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE CtftbebidkaiwnmaaforttadeapimtdStala} 



£. SEPARATE FURNISHING OF INDICATIONS 



Tbe indications listed beiow wiU be suonined to the intemaciona 1 Bateatt later ispee^skefomiiiMOinefike 
Numoer of Deposit') 



Forreodving OfCoe use cmly 



n <^ect was received with the intenational application 



Auihonzed ofOcer 



Fomi PCryRO/134 (July 1992) 



For Intenntionai Bureau use only 



n This sheet was received by the IntenatiGnal Bureau ok 



Authorized ofiQoer 
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Whar is claimed is: 

1. A nudeoride sequence comprising a raaise optimized 
coding sequence wiiicii encodes a prorein capable of killing an 
insect:. 

2. A nucleoride sequence of claim 1, wherein said 
coding sequence encodes a 3.t. protein. 

3. A nucleotide sequence of claim 2, wherein said 
coding sequence encodes a CrylA(b) protein. 

4. A nucleotide sequence of claim 3^ wherein said 
coding sequence comprises Sequence 3^ Sequence 4^ or the 
sequence sec forth in Fig. 7. 

5. A nucleotide sequence of claim 3, wherein coding 
sequence encodes a CrylA(b) protein which is heat stable 
compared to a native CrylA(b) protein. 

o. A nucleotide sequence of claim 5^ wherein said 
coding sequence comprises a sequence selected from the grotip 
of secpiences consisting of Tig. 9, Fig. 11, Fig. 13 and Fig. IS. 

7. A nucleotide sequence of claim 2, wherein said 
coding sequence encodes a Cry IB or a Cr7lA(c) protein. 

8. A nucleotide sequence of claim 1, wherein said 
coding sequence comprises the sequence encoding a CrylB protein 
set forth in Fig. 6. 

9. A nucleotide sequence of claim 1, further comprising 
a first promoter capable of directing expression of a nucleotide 
sequence in a plant cell, operably linked to said coding 
sequence . 

10. A nucleotide sequence of claim 9, wherein said 
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promorer is capable of direcring expression of zhe associared 
coding sequence in a maize cell. 

11. k nucleoride sequence of claim 9^ wherein said 
promoter is selected from the group consisting of inducible 
promoters p constitutive promoters, temporal or 
developmentally-regulated promoters, tissue-preferred, and 
tissue-^specif ic promoters. 

12. A nucleotide sequence of claim 9, wherein said 
promoter is selected from the group consisting of a CaMV 3SS 
promoter, CaMV 19S promoter , a PEP carboxylase promoter, a 
pith-preferred promoter, and a pollen-specific promoter. 

13. A nucleotide sequence of claim 9, wherein said 
promoter is a pith-preferred promoter comprising the DNA 
sequence set forth in Fig. 24. 

14. A nucleotide sequence of claim 9, wherein said 
promoter is a pollen-specific promoter comprising the DNA 
sequence set forth in Fig. 35. 

15. A nucleotide sequence of claim 9, further 
comprising a second promoter capable of directing expression of 
an associated coding sequence in a plant cell, operatively 
linked to a second coding sequence. 

17, A nucleotide sequence of claim 15, wherein said 
second promoter is selected from the group consisting of 
inducible promoters, constitutive promoters, temporal or 
developmentally-regulated promoters, tissue-preferred, and 
tissue-specific promoters , 

18. A nucleotide sequence of claim 15, wherein said 
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second oromoter is s leered from the group consisting of a CaMV 
35S promorer, CaMV 19S promoter, a PEP cari^oxylase promoter, a 
pith-preferred promoter, and a pollen-specific promoter. 

21. A nucleotide sequence of claim 15, wherein said 
second coding sequence is a plant optimized coding sequence 
which encodes a protein capable of killing an insect. 

22. A nucleotide sequence of claim 21, wherein said 
second coding sequence encodes a 3.t , protein. 

23. A nucleotide sequence of claim 22, wherein said 
second coding sequence encodes a CrylA(b) protein. 

29. A nucleotide sequence of claim IS, wherein said 
second coding sequence is a marJcer gene. 

30. A recombinant vector, comprising a nucleotide 

sequence of claim. 9. 

31. A recombinant vector, comprising a nucleotide 
sequence of claim 9, wherein said coding sequence encodes a 
protein. 

32. A recombinant vector of claim 31, wherein the B.t. 
protein is a CrylA(b) protein. 

33. A recombinant vector, comprising a nucleotide 
sequence of claim 15. 

34. A recombinant vector, comprising a nucleotide 

sequence of claim 18. 

35 . A recombinant vector, comprising a nucleotide 
sequence of claim 22. 

3S. A plant stably transformed with a nucleotide 
sequence of claim 9. 
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37. A pianr srabiy cranszoraied with a nucieoride 

sequence of claim 12. 

38. A planr of claim 36, wherein said coding sequence 

encodes a 3 . t . prorein. 

39. A plant of claim 36, wherein said prorein is 
expressed in said plant in an amount sufficient to control 
Lepidopteran or Coleopteran insects. 

40. A plant of claim 38, wherein the B.t. protein is a 
Cry lA (b > protein . 

41. A plant of claim 38, which expresses the 3.t. 
insecticidal protein in an amount sufficient to control 
Lepidopteran or Coleopteran pests. 

42. A plant of claim 38 , wherein the amount is 
sufficient to control insects selected from the group consisting 
of European com borer. Sugarcane borer, stalk borers, cutworms, 
armyworms, rootworms, wireworms and aphids. 

43. A plant stably transformed with a nucleotide 
secTuence of claim 15 . 

44. A plant stably transformed with a nucleotide 
sequence of claim 18. 

45. A plant of claim 43, wherein the first and second 
coding sequences each encode a B.t . protein. 

46. A plant of claim 38, which expresses the B.t . 
insecticidal proteins in an amoxint sufficient to control 
Lepidoptersm or Coleopteran pests. 

47. A plant of claim 46, wherein the amount is 

sufficient to control stalk borers. 
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48. An isolated and purified promoter capable of 



directing pitii-preferred expression of an associated structural 



gene in a plant. 



49. 



A promoter of claim 48 r isolated from a monocot. 



50. 



A promoter of claim 48, isolated from a maize 



plant. 

51. A promoter of claim 48, isolated from a plant 



52. A promoter of claim SI, isolated from a maize 
tryptophan synthase- alpiia (TrpA) subunit gene. 

53. A promoter of claim 48, comprising the sequence set 
forth in Figure 24. 

54. A recombinant DNA molecule comprising a promoter of 
claim 48, operably associated with a structural gene encoding a 
protein of interest. 

55. A recombinant DNA molecule of claim 54, wherein 
said structural gene encodes an insecticidal protein. 

5o. A recombinant DNA molecule of claim 55, wherein 
said structural gene encodes a Bacillus thurinaiensis protein. 

57. A vector, comprising a recombinant DNA molecule of 
claim 54 . 

58. A vector of claim 57, wherein said structural gene 
encodes- an insecticidal protein. 

59. A vector of claim 57, wherein said structural gene 
encodes a Bacillus thurinaiensis protein. 

oO. A vector, comprising at least two recombinant DMA 
molecules of claim 54, wherein at least one of the two 
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strucrurai genes encodes an insecricidal protiein, 

51. A pianr sraJaiy cransfonned wicli recombinant: DNA 
molecule of claim 54 . 

52. A planr of claim 51, whicn is - a maize planr. 

53. A purified promoter capable of direcring 
pollen-specific expression of an associated strucrurai gene in a 
plant, wherein said promoter is isolated from a plant 
calcium-dependent phosphate Jcinase {CDPK} gene. 

64. A promoter of claim 63, isolated from a monocot CDPK 

gene. 

55. A promoter of claim 63 , isolated from a maize CDPX 

gene. 

56. A promoter of claim 65 r cos^rising the sequence set 
forth in Figure 35. 

67. A recombinant DHA molecule ^ comprising a promoter 
of claim 53 ^ operably associated with a structural gene encoding 
a prctain of interest. 

5B. A recombinant DNA molecule f of claim 67, wherein 
said structural gene encodes an ' insecticidal protein. 

69. A recombinant DNA molecule of claim 63 , wherein 
said structural gene encodes a Bacillus thur inaiens i s protein. 

70. A vector, comprising at least one recombinant DNA 
molecule of claim 67. 

71. A vector of claim 70, wherein said structural gene 
encodes an insecticidal protein. 

72. A vector of claim 71, wherein said structural gene 
encodes a Bacillus thurinaiensis protein. 
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73. A veccor of claim 70, comprising cwo recombinant: 
DMA molecules, wherein at least one of tlie two structural genes 
encodes an insecticidal protein* 

74. A plant stably transformed with; a recombinant DNA 
molecule of claim 57. 

75. A plant of claim 14, which is maize plant • 

76. A maize plant stably transformed with at least one 
recombinant DKA molecule; 

wherein said DNA molecule comprises a promoter 
operably linked to a nucleotide sequence encoding an 
insecticidal prorein. 

wherein said promoter is capable of directing 
tissue-preferred or tissue-specific expression of said gene in 
said maize plant. 

77. A maize plant of claim 76, wherein said gene 
encodes a Bacillus thurinaiensis protein. 

78. A plant of claim 76, wherein said promoter is 
obtained from a monocot. 

79. A plant of claim 78, wherein said monocot is 

maize. 

30. A muze plant of claim 76, wherein said promoter 
is capable of directing pith-preferred expression of said gene 
in said maize plant. 

31. A plant of claim 76, wherein said promoter is 
obtained from a plant tryptophan synthase-alpha sutaunit gene. 

82- A plant of claim 76, wherein said promoter is 
obtained from a maize tryptophan synthase- alpha subunit gene. 
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83. A pianr of claim 82, wherein said promocer 
comprises tlie sequence set f orrii in Figure 24 • 

84. A plant of claim IS, wherein said promoter is 
capable of directing green tissue-specific expression of said 
gene in said maize plant. 

35. A plant of claim 84, wherein said promoter is a 
PEP carboxylase promoter. 

86. A maize plant of claim 76, wherein said promoter 
is capable of directing pollen-specific expression of said gene 
in said maize plant. 

87. A maize plant of claim 76, wherein said promoter 
is obtained from a plant calcium-dependent phosphate kinase 
gene. 

38. A maize plant of claim 76, wherein said promoter 
is obtained from a maize calcium-dependent phosphate icinase 
gene. 

89. A maize plant of claim 76, wherein said promoter 
comprises the sequence set forth in Figure 35. 

90 . A method of producing a maize optimized coding 
sequence for an insecticidal 3.t. protein, comprising: 

determining the amino acid sequence of a predetermined 
insecticidal B.t, protein, and 

altering the coding sequence of the protein by 
substituting codons which are most preferred in maize for 
corresponding native codons. 

91. A method of protecting a maize plant against at 
least one insect pest, comprising: 
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srabiy transforming a maize plant witli at least on 
nucleotide secraence of claim 9, wherein tiie coding sequence 
encodes an insecticidal protein; whereby the transformed maize 
plant expresses the insecticidal protein in an amount sufficient 
to protect the plant against the pest. 

92. The method of claim 91, wherein said insecticidal 
protein is a 3-t . protein* 

93. The method of claim 91 , wherein said promoter is a 
tissue-specific or a tissue-preferred promoter. 

94 . A plant which, has been stably transformed with a 
nucleotide sequence wherein said nucleotide sequence comprises a 
maize optimized coding sequence for an insecticidal proteinp 
wherein said protein is expressed in said transformed plant at 
least 100 fold greater than expression of the protein using a 
native coding sequence. 
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T ^ 



BTHKURHD 
f Isvnftt . fin 
bssvn 



-BTHKURHD 

■ f Isvnbr . f in 
bssvn 



* 



20 
• 



30 



4^ 
* 



50 
* 



60 



ATGGATAACAATCCGAACATCAATGAATGCATTCCTTATAATTGTTTAAGTAACCCTGAA 

C C..C C..G C..C..C..C..CC.G,.C C..G 

C C..C C..G C..C..C,.C..CC.G..C C..G 



70 



80 



90 
* 



100 



110 



120 



GTAGAAGTATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCAATCGATATTTCCTTG 

..G..Z.,GC.Q,.C,.C,.GC,C..C..G..C..C C C. CAG . C . . 

..G..G..GC.G..C..C..GC.C..C..G,.C. .C C C. .CAG.C. . 



BTHKURHD 

f Isvnbt • fin 
bssyn 



130 140 150 160 170 130 

* * * ♦ ♦ * 

TCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAGGACTA 

AGC, .G. .C. .G. .C. .GC C..G..C.,G C..C..C..C,..C.G.,C..G 

AGC. .G.,C..G,.C..GC.,.,C,.G..C..G C..C..C..C...C.G..C..G 



BTHKURHD 

f Isvnbt . fin 
bssyn 



190 



200 
* 



210 



220 



230 



240 



GTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATT 

..G..C..C..C C. ,C. .C. .C. . .AGC. .G C..C..G..G.,G..C 

..G..C..C..C C..C..C..C.. .AGC. ,G C. . C . . G . . G , . G . . C 



BTHKURHD 
f Isynbt . fin 
bssyn 



BTHKURHD 
f Isvnbt . f in 
wssvn 



250 260 270 280 290 300 

* « * * ♦ ^ 

GAACAGTTAArrAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTA 

. .G. . .C.G. .C GC.C..C..G..G CC.C G CAGCCCC^G 

. .G. . .C.G. .C GC,C..C..G..G CC.C G CXGCC.CC.G 



310 



320 



330 



340 



350 



360 



GAAGGACTAAGCAATCTTTATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGAT 

..G..C..G C..G..C C C. ,GAGC. .CC.C G. .C. .C 

..G. ,C. .3 C..G..C C C. . CAGC . . CC . C G . . C . . C 



BTHKURHD 
: Isynbt . fin 
bssyn 



370 380 390 400 410 420 

♦ ♦ * * * t 

CCTACTAATCCAGCATTAAGAGAAGAGATGCGTATTCAATTCAATGACATGAACAGTGCC 

, .C. .C. .C. .CC.GC.C, .G C.C.G C C... 

,.C..C..C,.C.,CC.GC.C».G C.C.G C C... 



BTHKURHD 
f isynbt . f in 
bssyn 



430 440 450 460 470 480 

* ♦ * * » * 

CTTACAACCGCTATTCCTCTTTTTGCAGTTCAAAATTATCAAGTTCCTCTTTTATCAGTA 

.•G..C C..C..C..G..C; ,C,,G,.G..C..C..G..G,.C. .GC.GAGC. .G 

..G..C C. . C. C. . G,.C..C.,C.G. .C.C.G. .G..C. .GC.GkGC. ,G 



BTHKURHD 
f Isynbt . fin 
bssyn 



BTHKURHD 

f Isynbt. fin 
bssyn 



490 



500 



510 

« 



520 



530 

* 



540 



TATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGGACAA 

. . C • G . . C . C . C . CC G . . CC GAGC . .GC. .C .C . .C . . CAGC C.C.G 

, .a, .G. .G. .C. .C. .CZ.G. ,CC. GAGC . .GC. ,C.C. .C. . CAGC C.C.G 



550 



560 



570 



580 



590 



600 



AGGTGGGGATTTGATGCCGCGACTATCAATAGTCGTTATAATGATTTAACTAGGCrTATT 

CC C.C.C C.C C.C.C.C.C.CCC.CCC.C.C 

CC C.C.C C..C....C.C.C..C.C.CCG..CCC.G..C 
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1220 



FIG. - ICGNT) 



1230 



124c 



1250 



1260 
* 



BTHKURHD 

f Isvnbt. f in 

bssyn 



3THKLTlhD 
'f Isynbt. f in 
bssyn - 



TACAGAAAAAGCGGAACGGTAGATTCGCTGGATGAAATACCGCCACAGAATAACAACGTG 

• • .C*C. aG* • • • m Q ^ aC* aG* • CAGC . .C««G«.C««C«.T**a*>C.*«..«... 

. .-C.C.G C.,C--G.. CAGC C..G..C..C..T C 

1270 1280 1290 1300 1310 1320 

» ' * « . * * ^ 

CCACCTAGGCAAGGATTTAGTCATCGATTAAGCCATGTTTCAATGTTTCGTTCAGGCTTT 

C,A..G,,C..C..C..C.-TC,G C. .GAGC. • . . .C, . CAGT C 

C.A..G..C..C..C.,C..TC.G C. .GAGC C. .CAGT C 



37HKURHD 
f Isynbt. f in 
bssyn 



1330 
* 



1340 



1350 



1360 



1370 



1380 



AGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCTCTTGGATACATCGTAGTGCT 

..C..C,.C..C..G..C.,C,.CC.T..A AGC T. .C, .C C 

..C.-C..C,.C..G..C..C..CC.T..A AGC T. .C. .C C 



3THKUimD 

f isynbr . t in 
bssyn 



1390 



1400 



1410 
* 



1420 
* 



1430 



GAATTTAATAATATAATTCCTTCATCA--CAAATTACACAAATACCTTTAACAAAATCTA 
..G..C..C..C,.C,.C.. — ..G..GC,,G..C..C,.G..C,.CC.G..C.. GAGC . 
. - G. .V*. .u. .Ca aC. (C* .G. . G C . .G. .C. .C* .G. .C*. CC • 'u • • C . . GAGC • 



1450 



1460 



i 1 • - 



1480 



3THKURHD 

f Isynbt. f in 
bssyn 



CTAATCTTGGCTCTGGAACTTC7GTCGTTAAAGGACCAGGATTTACAGGAGGAGATATTC 
.C. .C. .G. . .AGC. .CAGC. .G. .G. .G. .C. -C. .C. .C. .C. .C. .C. .C. .C. 
, C. , C . . G . . . AGC . . C . . CAGC ..G..G..G..C..C..C,.C..C..C,,C..C..C, 



1500 1510 1520 1530 1540 1550 

* * * * J * ♦ 

BTHKURHD TTCGAAGAACTTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATT.aiCTGCACCATTAT 

f Isynbt . f m .G. ,CC,C. . CAGC . . C CAGC . . . C . GC . C . . G . . C • . C . . C . . C. . CC . GA 

bssyn . G . . CC . C . . CAGC . . C CAGC . . .CZC.C. . G . . C . . C, . G . . Z . . CC . GA 



bssyn 



1560 1570 1580 1590 1600 1610 

♦ * « * ♦ 

CACAAAGATATCGGGTAAGAATTCGCTACGCTTCTACCACAAATTTACAATTCCATACAT 

CAGC C . . CC . G . . G C , . CA 



3THKURHD 

f Isynbr. .fin GC . . GC . C . . C . . C . . CC . C . . C 



GC . . GC . C , . C . . C . . CC . C . . C CAGC C . . CC . G . . G C . . CA 



1620 1630 1640 1650 1660 1670 

* ♦ * ♦ * ★ 

3THKURHD CAATTGACGGAAGACCTATTAATCAGGGGAATTTTTCAGCAACTATGAGTAGTGGGAGTA 

^ f Isynbt .fin GC . . C CC . C C C C ..... C C . . CAGC •.C.,C C..C..C..C. 

ibssyn GC..C CC.C..C..C..C C.C. CAGC ..C..C C.,C..C..C. 



1680 1690 1700 1710 1720 1730 

3THKURHD ATTTACAGTCCGGAAGCrrTAGGACTGTAGGTTTTACTACTCCGTTTAACTTTTCAAATG 

f Isynbt .fin . CC . G ... AG ... C CC . C . . C . . G . . C . . C . . C . . C . . C . . C CAGC . . C . 

bssyn .CC.G.,.AG..-C CC.C. .C. .G . .C. .C . .C . .C. .C. .C CAGC. .C. 

«^ 

1740 1750 1760 1770 1780 1790 ■ 

* * * « » ♦ 

BTHKURHD GATCAAGTGTATTTACGTTAAGTGCTCATGTCTTCAATTCAGGCAATGAAGTTTATATAG 

f Isynbt. fin .CAGC, ,C..G.,C..CC,G. .C..C..C. .G CAGC C..G..G..C..C. 

bssyn .CAGC. .C..G..C..CC.G..C.,C..C..G CAGC C..G..G.,C,.C. 
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FIG. 1 ICONT) 

1800 JIO 1820 1830 ^^40 1350 

3THKURHD ATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAAGAG 

f Isynbt .fin .Z,.C,.C..G..C.,G,.C..C,.G..G C C..G..C..CC.G...G,.G. 

bss yr. ..C. ,C, .G , .Z. .G..C. .C..G. .G C C..G. .C..CC.G..G..G. 

1860 1870 1880 1890 1900 1910 

3THKURHD CACAAAAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATG 

f isynbt. fin .7. .G C C C. .CAGCAG. ..C.G ^^'G. .G . .C. .C 

OSSyn .T..G C C C. .CAGCAG. . .C. -G CC . G . , G ! ! C ! ! C ! 

1920 1930 1940 1950 1960 1970 

* * * ♦ ♦ ♦ 

3THKURHD TGACGGATTATCATATTGATCAAGTATCCAATTTAGTTGAGTGTTTATCTGATGAATTr^ 

f Isynbt. fin C..C..C..C..C G. .GAG, . .CC.G. .G CC.GAGC. .C. .G. .c! 

bssyn ..,.C..C..C..C..C.... 

1980 1990 2000 2010 2020 2030 

3THKURHD GTCTGGATGAAAAAAAAGAATTGTCCGAGAAAGTCAAACATGCGAAGCGACTTAGTGATG 

f Isynbt. fin .C Z. .G. .G. ,G. .GC. .AG G. .G. .G, ,Z. ,Z C.\G. .Z. .Z 

bssyn 



3THKURHD 
f Isynbt . 
bssvn 



3:HKL71HD 
f Isvnbt . 
bssvn 



40 2050 2060 


2070 


2080 2090 






♦ ♦ 


AGCGGAATTTACTTCAAGATCCAAACTTTAGAGGGATCAATAGACAACTAGACCGTGGCT 








00 2110 2120 
* * « 


2130 


2140 2150 


GGAGAGGAAGTACGGATATTACCATC 


« 

C.\AGGAGGCG 


ATG A CG T ATT CAAAG AG AATTACG 









2160 2170 2180 2190 2200 . 2210 

****** 

3THKURHD TTACGCTATTGGGTACCTTTGATGAGTGCTATCCAACGTATTTATATCAAAAAATAGATG 

f Isynbt, fm .G..C..GC....C C..C Z . .Z . ,Z . .ZZ .G , . Z . .G . . G . ,Z . Z 

bssyn , 



2220 2230 2240 2250 2260 2270 

3THKURHD AGTCGAAATTAAAAGCCTATACCCGTTACCAATTAAGAGGGTATATCGAAGATAGTCAAG 

f^synDt.fin . . AGC. .GC.G. .G C C GC.GC.C. .C. .C G. .C. ,C. .G. 

^ssyn 

2280 2290 2300 2310 2320 2330 

* * * ♦ « « 

3THXURHD ACTTAGAAATCTATTTAATTCGCTACAATGCCAAACACGAAACAGTAAATGTGCCAGGTA 

f Isynbt.. fin ..C.G..G CC.G..C C G G. .Z. ,G. .Z C..C. 

bssyn -1-1111-11.1 

2340 2350 2360 2370 2380 2390 • 

♦ « * # * ♦ 

3THKURHD CGGGTTCC1TATGGCCGCTTTCAGCCCCAAGTCCAATCGGAAAATGTGCCCATCATTCCC 

f Isynbt . fin .C. .CAG.C.G. . . . .C. .GAGC C. .C. .C C. .G. .C. . . . .,C. .CAG 

bssyn 11111.11 .1* 
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r - o . i t t-uN i ) 

2400 ^.10 2420 2430 2^40 2450 

^ It * It « « 

BTHKURHD ATCATTTCTCCTTGGACATTGATGTTGGATGTACAGACTTAAATGAGGACTTAGGTGTAT 

flsynbt.fin .C..C...AG.C C..C.,G,.C..C..C.,.C.G,.C C.G . .C. .G, 

bssyn ' ^ . 

'J. 

2460 2470 2480 2490 2500 2510 

« ^ « * ♦ » 

BTHKURHD GGGTGATATTCAAGATTAAGACGCAAGATGGCCATGCAAGACTAGGAAATCTAGAATTTC 

*f Isynbt .fin C C C«.G..C C,.CC.C,.G..C..C,.G..G..C. 

bssyn . ll-ll.l 

2520 2530 2540 2550 2560 2570 

* * * * # « 

BTHKURHD TCGAAGAGAAACCATTAGTAGGAGAAGCACTAGCTCGTGTGAAAAGAGCGGAGAAAAAAT 

flsynbt.f in .G, .G G..CC.G.,G. .C..G..C. .G. .C. .C GC.C..C G.,G. 

bssyn 11-1 

2580 2590 2600 2610 2620 2630 

****** 

BTHKURHD GGAGAGACAAACGTGAAAAATTGGAATGGGAAACAAATATTGTTTATAAAGAGGCAAAAG 

flsynbt .f in ..C.C G. .C. .G. .GC G G . ,C. .C. .C, .G , .C . .G C..G 

bssvn 1 ' 



2640 2650 2660 2670 2680 2690 

**«**« 

BTHKURHD AATCTGTAGATGCTTTATTTGTAAACTCTCAATArGATAGATTACAAGCGGATACCAACA 

f Isynbt. fm .GAGC. .G-,C..CC.G..C..G.. .AGC. . G . . C . . CC , CC . G - . G . . C . . C 

bssvn 1 * * ' 

2700 2710 2720 2730 2740 2750 

****»« 

BTHKURHD TCGCGATGATTCATGCGGCAGATAAACGCGTTCArAGCATTCGAGAAGCTTATCTGCCTG 

flsynbt.. fin C C..C.,C..C..C..G G..C C. .G. ,C. .G c, 

tssvn 11 ' ' ' 

2760 2770 2780 2790 2800 2810 

* * * « * * 

BTHKURHD AGCTGTCTGTGATTCCGGGTGTCAATGCGGCTATTTTTGAAGAATTAGAAGGGCGTATTT 

flsynbt. fin AGC ^C. .C, .G. .C. .C. .C. ,C, .C. .G. .GC.G. .G. .C. .C. .C 

i^ssyn 1.11.1 

2820 2830 2840 2850 2860 2870 

****** 

BTHKURHD TCACTGCATTCTCCCTATATGATGCGAGAAATGTCATTAAAAATGGTGATTTTAATAATG 
flsynbt .fin . . . .C, .C, .AG,..G..C. .C. .CC.C. .C. .G. .C. .G. .C. ,0, .C. .C.C. .C. 
Ibssyn 1-11-11-11-11^1 

2380 2890 2900 2910 2920 2930 

*♦***♦ 

BTHKURHD GCTTATCCTGCTGGAACGTGAAAGGGCATGTAGATGTAGAAGAACAAAACAACCACCGTT 

f Isynbt. fm . .C.GAG G..C..C..G..C..G.,G..G..G CA 

bssyn H 

2940 2950 2960 2970 2980 2990 • 

• * • • » 

BTHKURHD CGGTCCTTGTTGTTCCGGAATGGGAAGCAGAAGTGTCACAAGAAGTTCGTGTCTGTCCGG 

flsyntit.fin GC..G..G..G..G..C..G G..C..G.. .AGC. .G..G..G..C..G..C..C. 

bssyn " 
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FIG. i ICONT) 

3000 jwiO 3020 3030 3UtO 3050 

* « ♦ ♦ * « 

3THKURHD GTCGTGGCTATATCCTTCGTGTCACAGCGTACAAGGAGGGATATGGAGAAGGTTGCGTAA 

flsynbt.fin .C..C C g. .C, .G. .C. .C C..C..C..G..C G. 

bssvn • — 

3060 3070 3080 3090 . 3100 3110 

* * * ♦ ♦ » 

^THKURHD CCATTCATGAGATCGAGAACAATACAGACGAACTGAAGTTTAGCAACTGTGTAGAAGAGG 

isynbc- f in ....C.C C.C G..C C C.,G,.G.... 

bssyn — — « —llll 

3120 3130 3140 3150 3160 3170 

* 4 * * « 1^ 

3THKURHD AAGTATATCCAAACAACACGG7AACGTGTAATGATTATACTGCGACTCAAGAAGAATATG 

f Isynbt.f in .G..G.,C..C C..G..C-.C..C..C..C..C.,C,.C..G,,G..G..C, 

bssvn 11-1 

3180 3190 3200 3210 3220 3230 

* ★ * ♦ * n 

3THKURHD AGGGTACGTACACTTCTCG7AATCGAGGATATGACGGAGCCTATGAAAGCAAT7CTTCTG 

f Isynbt . f in C.C CAGC. .0.. C.C. .C.C C C.G CAGCAGC 

bssvn 

324C 3250 3260 3270 3280 3290 

* * « it ■ -k « 

3THKURHD TACCAGCTGATTATGCATCAGCCTATGAAGAAAAAGCATATACAGATGGACGAAGAGACA 
f Isynbt. fin .G-. C.C. C.C .CAGC... ..•C,.G..G..G..C..C..C..C..C..CC.C..., 
bssyn 1-1111 

3300 3310 3320 3330 3340 3350 

* « * * ■ « 

3THKURHD ATCCTTGTGAATCTAACAGAGGATATGGGGATTACACACCACTACCAGCTGCCTATGTGA 

f Isynbt.f in .C .C .C .GAGC . .CC .C.C .C .C C .C .G. .C.C C. . 

bssvn 11111_11' 

3360 3370 3380 3390 3400 3410 

* * ♦ ♦ * * 

3THKURHD CAAAAGAATTAGAGTACTTCCCAGAAACCGATAAGGTATGGATTGAGATCGGAGAAACGG 

f Isynbt. fin .C.G..GCG C.G C G C c G r 

bssyn . 

3420 3430 3440 3450 3460 

***** 

3THKURHD AAGGAACATTCATCGTGGACAGCGTGGAATTACTTCTTATGGAGGAATAA 

: Isynbt. fin .G. .C .C . . .GCG. .c .G c .G 

bssyn . . .TG. .G 
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BTHKURHD 

bssyn 



1 w 



.20 30 4l 50 60 

««♦*•* 

ATGCATAACAATCCGAACATCAATGAATGCATTCCTTATAATTGTTTAAGTAACCCTGA^ 
C C..C C..G C..C..C..C..CC.G..C C.-G 



BTHKURHD 
iissyn 



BTHKURHD 
bssyn 



70 



80 



90 



100 



110 



120 



GTAGAAGTATTAGGTGGAGAAAGAATAGAAAC7GGTTACAC! 
. .G. •G, .GC.G* .C. .C. .GCC. .C. .G. .C. .C 



:CAATCGATATTTCCTTG 
- C C . . GAG • C. . 



130 

TCGCTAACGCAA 
AGC, .G. .C. .G 



140 150 160 

* * * 

CTTTTGAGTGAATTTGTTCCCGGTGCTGGAT 
C..GC-...C..G..C..G C..C..C 



170 180 
* • 

rTGTGTTAGGACTA 
C. . .C.G. .C. .G 



BTHKURHD 

bssyn 



190 200 210 220 230 240 

***** ^ 

GTTGATATAATATGGGGAATrrTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATT 
..G..C..C..C C.CC.C. .AGC. ,G C..C..G..G..G..C 



BTHKURHD 
bssvn 



250 260 270 280 290 300 

« * * * * « 

GAACAGTTAATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTA 
.C.G..C GC-C..C..G..G CC.C G CAGCC.CCG 



G 



BTHKURHD 
bssyn 



310 



320 
* 



330 
* 



340 
* 



350 
* 



360 
* 



GAAGGACTAAGCAATCTTTATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGAT 
. .G. .C,.G C, .G. .C C C. . GAGC , . CC . C G . , C . . C 



BTHKURHD 
bssyn 



370 

* 



380 
* 



390 
* 



400 
* 



410 
♦ 



420 



:CTACTAATCCAGCATTAAGAGAAGAGATGCGTATTCAATTCAATGACATGAACAGTGCC 
.C, .C. .C, .C. .CC.GCC. .G ,C. .C, .G ,C C. • . 



BTHKURHD 
bssyn 



430 440 450 460 470 480 

***♦*♦ 

CTTACAACCGCTATTCCTCTTTTTGCAGTTCAAAATTATCAAGTTCCTCTTTTATCAGTA 
..G..C C..C..C..G..C,.C..G..G..C..C.,G..G..C..GC. GAGC. . G 



BTHKURHD 
bssyn 



BTHKURHD 

bssyn 



490 500 510 520 530 540 

♦ * * * * * 

TATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGGACAA 
. . C. . G . . G • . C . . C . . CC • G . • CC . GAGC . . GC . . C • C - . C . * CAGC C..C..G 

550 560 570 580 590 600 

t * * ♦ * ^ 

AGGTGGGGATTTGATGCCGCGACTATCAATAGTCGTTATAATGATTTAACTAGGCTTATT 
C,C C..C*.C C.C C..C..C..C..C..CC.G.,CC,C*.G..C 



BTHKURHD 
bssyn 



610 
* 



620 
* 



630 
* 



640 
* 



650 
« 



660 
* 



GGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGCATTAGAGCGTGTATGGGGA 
........C..C,.C..C..C..G C.C.CC.G C..G T 



BTHKURHD 
bssyn 



670 



680 
* 



690 
* 



700 
* 



710 
* 



720 



CCGGATTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTA 
. .C. .CAGCC.C. .C C-.G..C..C,,G..CC.CC.C.,GC.G..C..G..C..G 
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FIG. : (CONTl 



BTHKURHD 
f Isynbt. fin 
bssyn 



3THKURHD 
''f IsvnDt . fin 
bssvn 



6x0 620 630 64o 650 660 

« * « * * « 

GGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGA 

C. .C. .C. .C. .C. C. ,C. .CC.G C.,G T 

C..C..C..C..C..G C..C..CC.G C..G T 



670 680 690 700 710 720 

« * * ' * * if 

CCGGATTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTA 
. . C . . CAGCC .C. ,C C. .G. .C, .C. .G, .CC.CC.C. .GC*G. .C. .G. .C. .G 

• • G • • GAGCG aO* vO* • • • aG* ^G* aO* aO* aGa • OG • GG • G • a <jG aGa aGa aG* aGa cG 



BTHKURHD 

f Isynbt. fin 
bssyn 



730 



740 

it 



750 



760 



770 



780 



TTAGATATCGTTTCTGTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTT 

C.Ga aC GAGC. .G..C..C C..C..CC.C..C..C..C.,C..G..C..G 

C.G. .G GAGC. .G..C..C C..G..GCaC..C,.Ca.Ca.C,.G..C..G 



BTHKURHD 
flsvnbt . fin 
bssvn 



790 



800 



310 



820 
★ 



830 



840 



TCCCAATTAACAAGAGAAATTTATACAAACGCAGTATTAGAAAATTTTGATGGTAGTTTT 
kG, . .GC.G. .CC.C. .G CaaC G..GC.Ga.G..C..Ga.C..C..C..C 

AGa..GC-G..CCaG..G G,.G C..GGaGa.G..Ca.C..C..G..Ca.C 



BTHKURHD 

f Isynbr . fin 
bssyn 



850 
* 



36C 
« 



370 



380 



390 
♦ 



900 



CGAGGGTCGGCTCAGGGCATAGAAGGAAGTATTAGGAGTCCACATTTGATGGATATACTT 

. a G a a a AGC a.C C..G,.G.aC-aCC.C..C.,C.aCC C.aC.G 

a aC. a a AGC, aC C , . G . . C. , C . . CC, C . . , C , . CZ C.C.G 



BTHKURHD 
flsvnbt .fin 
bssyn 



910 920 930 940 - 950 960 

« * ♦ ♦ *^ ♦ 

AACAGTATAACCATCTATACGGATGCTCATAGAGGAGAATATTATTGGTCAGGGCATCAA 

C..C C..C..Ca.C..CC.C..CaaG..C.aC... AGC . a C . a C a a G 

Ca.C C..C..C..C..CG.Ca.C..G..C..C... AGC . . C . . C . . G 



BTHKL^RHD 
f Isynbt. fin 
bssvn 



970 . 980 990 1000 1010 1020 

* « * * * ♦ 

ATAATGGCTTCTCCTGTAGGGTTTTCGGGGCCAGAATTCACTTTTCCGCTATATCGAACT 

. a C GAGC a . C . a C. a G a a CAGC a aC. .C. -G C. aCaaC. .G. aCaC. aC 

a . C GAGC a a C a . C a . C a a CAGC a, C. a C.G C..C..C.aG.aC..C..C 



BTHKURHD 
f Isynbt . fin 
bssyn 



1030 1040 1050 1060 1070 1080 

♦ * ♦ * * # 

ATGGGAAATGCAGCTCCACAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGA 

C. . C. . Ta. A. a Ta.G. a Ga aC.aCaaG.. A. .G.aG..C A CCC 

C..C..T,.Aa.Ta.G..G..C..C..G..A..G.aG.aC A CC^C 



BTHKURHD 

f Isynbt, fin 
bssyn 



1090 



1100 



1110 
* 



ACATTATCGTCCACTTTATATAGAAGACCT 

. aCCaGAGCAG. a .CC a G. a CC.TCa a , 
. a CC a GAGCAG - a a CC a G a a CC a TC a a 



1120 1130 1140 

♦ ♦ ♦ 

AATATAGGGATAAATAATCAACAACTA 

aCa .Ga aC* aCa aCa aCa aCa aG* vG* aG 
.C.aCa.C.aCaaC..C..Ca aG.aG.aG 



BTHKURHD 
f Isynbt. fin 
bssyn 



1150 1160 1170 1130 1190 1200 

* * • « « ^ 

TCTGTTCTTGACGGGAGAGAATTTGCTTATGGAACCTCCTCAW^TTTGCCATCCG^^ 

AGC a .G..G C.-CaaGa.C.aCaaC.aCa. a AG . AGC a aCC CAG. ..C.G 

AGC. .G..G C..C..G.,Ca.Ca.C..C.a a AG. AGC a .CC. a . a CAG a ..C.G 
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BTHKURiiD 
bssyn 



BTHKURHD 
.bssyn 



BTHKURHD 
bssvn 



FIG. : ICONT) 

740 750 76- 770 780 

« « * • « 

TTAGATATCGTTTCTCTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGXACAGTT 
C.G. .C GAGC. .G..C-,C C..C..CC.C..C,.C..C..C,.C.,C..G 



790 
* 



300 310 820 830 340 

* ♦ * ♦ ♦ 

TCCC.\ATTAACAAGAGAAATTTATACAAACCCAGTATTAGAAAATrrTGAT^ 

G C. .C .QC.Q. .G. .C. .C.,C. .C. .C 



AG 



GC.G 



850 860 870 880 390 900 

* * • « « « 

CGAGGCTCGGCTCAGGGCATAGAAGGAAGTATTAGGAGTCCACATTTGATGGATATACTT 
. . C . . . AGC ..C C..G,.C,.C..CC.C..C..C..CC C.C.G 



3THKURHD 
bssyn 



3THXURHD 
bssvn 



910 920 930 940 950 960 

t * ♦ * ♦ t 

AACAGTATAACCATCTATACGGATGCTCATACAGGAGAATATTATTGGTCAGGGCATCAA 
C..C C-.C..C..C. •CC.C.C. .G..C, .C. ,AGC. .C. .C. .G 



970 980 990 1000 

t * « ♦ 

AT AATGGCTTCTC CTG T AGGG7TTTCGGGGC C AG AATTCACT1 
. . C CAGC . . C . . C , . C . • CAGC .•C..C..G C 



1010 1020 

TCCGCTATATGGAACT 
C. .C. .G. .C. .C* .C 



BTHKURHD 
bssyn 



BTHKLTlMD 
bssvn 



1030 1040 1050 1060 1070 1080 

ATGGGAAATGCAGCTCCACAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGA 
C.-C..T..A..T..G..G..C..C..G..A..G..G-.C A CC.C 



ACA 



1090 liOO 1110 1120 1130 1140 

****** 

TTATCGTCCACTTTATATAGAAGACCTTTTAATATAGGGATAAATAATCXACAACTA 
C . GAGCAG ...CC.G..CC.TC C,,C..C..C.,C,,C,.C,.C..G..a 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



1150 1160 1170 1180 1190 1200 

****** 

TCTGTTCTTGACGGGACAGAATTTGCTTATGGAACCTCCTCAAATTTGCCATCCGCTGTA 
AGC . .G. .G C - .C. - G, .C. ,C, .C - .C. • ,AG- AGC . . CC . . . . CAG . . , C - . G 

1210 1220 1230 1240 1250 1260 

* * * * * * 

TACAGAAAAAGCGGAACGGTAGATTCGCTGGATGAAATACCGCCACAGAATAACAACGTG 
,..C.C,.G C..C..G.. CAGC C..G..C,.C..T C 



BTHKURHD 

bssyn 



BTHKURHD 

bssyn 



BTHKURHD 
bssyn 



1270 
* 



1280 
* 



1290 

Ik 



1300 
* 



1310 
« 



1320 
* 



CCACCTAGGCAAGGATTTAGTCATCGATTAAGCCATGTTTCAATGTTTCGTTCAGGCTTT 
C.A..G-.C..C..C..C,,TC.G.....C. .GAGC C. . CAGT C 



1380 



1330 1340 1350 1360 1370 

• ***•* 

AGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCTCTTGGATACATCGTAGTGCT 
..C..C..C,.C..G,.C..C..CC.T..A AGC T,.C..C C 



GAA 
. .G 



1390 1400 1410 1420 1430 

* • V * * 

rAATAATATAATTCCTTCATCA — CAAATTACACAAATACCTTTAACAAAATCTA 
C..C..C..C..C.. — ,.G..GC,.G..C..C..G..C..CC.G..C.. GAGC . 
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3THKURHD 
bssyn 



3THKURHD 

^ssyn 



BTHKURHD 
bssvn 



FIG. 2 iCQNT) 

1440 ih50 1460 1470 l*r<jO 1490 

CTAATCTTGGCTCTGGAACTTCTG7CGTTAAAGGACCAGGATTTACAGGAGGAGATATTC 
,C. .C. .G. . .AGC. .C. .CAGC. .G . . .G . .C . .C . .C. .C. . C . . C . .C. 

1500 1510 1520 1530 1540 1550 

t t ♦ ♦ * * 

TTCGAAGAACTTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATTACTGCACCATTAT 
.G. .CC.C. .CAGC. .C CAGC. . ,C.GC.C. .G. .C. .C. .C. . C . .CC.GA 



15 



60 1570 1580 1590 1600 1610 
* * ♦ ♦ ♦ * 
CACAAAGATATCGGGTAAGAATTCGCTACGCTTCTACCACAAATTTACAATTCC^^ 
GC. .GC.C. .C, .C. .CC.C. ,C CAGC C. .CC.G. .G C. -CA 



STHKLl^D 
bssyn 



BTHKLUHC 
bssyn 



BTHKURHD 
bssyn 



3 T HXL ru" D 
bssvn 



1620 1630 1640 1650 1660 1670 
« • ♦ * * * 
CAATTGACGGAAGACCTATTAATCAGGGGAATTTTTCAGCAACTATGAGTAGT^ 
GC..C CC. C..C..C.,C C..C. - CAGC. .C..C C.C.C.C. 



1680 1690 1700 1710 1720 1730 

AT^TACAGTCCGGAAGCTTTAGGACTGTAGGTTTTACTACTCCGTTTAACTTTTCAAATG 
.CC.G...AG,..C CC.C..C..G..C..C..C..C..C..C CAGC . . C . 

1740 1750 ' 1760 1770 1780 1790 

t * * * ♦ * 

GATCAAGTG7ATTTACG7TAAG7GCTCATGTCTTCAATTCAGGCAATGAAGTTTATATAG 
.CAGC. .C. .G. .C. .CC.G. .C. .C. ,C. .G CAGC C..G.,G..C..C. 



1300 1810 1320 1830 

♦ * * * 

ATCGAATTGAATTTGTTCCGGCAGAAGTAACCT 

.C. .C, .C. .G. -C. .G, .C. .C. .G. .G. 



1840 
* 



1850 



rTGAGGCAGAATATGATTTAGAAAGAG 
C C..G..C..CC.G..G..G. 



3THKURHD 
bssyn 



1360 



1370 



1380 



1890 



* * « « 



1900 
* 



1910 



CACAAAAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATG 
. T . . G C • . . . • C C . . CAGCAG . . . C . . G CC . G . . G . • C . . C . 



BTHKUTIKD 
bssyn 



1920 1930 1940 
« » ♦ 
TGACGGA7TATCATATTGATCAAGTATCC 
, . . .C. ,C. .C. .C. .€ G-.G.AG 
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synlT . mze 
bssyn 

svnfui . nod 



syniT. mze 
'bssyn 
synful .mod 



syniT . nze 
bssyn 

svnfui . nod 



synlT. nze 
bssvn 

svnfui . nod 



synlT.nze 
bssyn 

synful .nod 



S V" ^ . .7.2 S 

bssyn 

s -T. r u 1 . nod 



synlT.mze 
bssyn 

synful .nod 



synlT.nze 
bssyn 

synful .mod 



synlT. nze 
bssyn 

synful .mod 



synlT. mze 
bssyn 

synful .mod 



FIG. 3 

20 30 4o 50 60 

* * ★ * » » 

ATGGACAACAACCCCAACATCAACGAGTGCATCCCCTACAACTGCCTGAGCAACCCCGAG 

70 80 90 100 110 120 

* * ♦ . ♦ -» if 

GTGGAGGTGCTGGGCGGCGAGCGCATCGAGACCGGCTACACCCCCATCGACATCAGCCTG 

130 140 150 160 170 130 

* * * « * » 

AGCCTGACCCAG7TCCTGCTGAGCGAGTTCGTGCCCGGCGCCGGCTTCGTGCTGGGCCTG 
******>•■*•*■*•••••••••••■•■■••*••••■••■•■••■•■«.••* 

190 200 210 220 230 240 

* * « * « * 

GTGGACATCATCTGGGGCATCTTCGGCCCCAGCCAGTGGGACGCCTTCCTGGTGCAGATC 

250 260 2':'C 280 290 30C 

* * * « * « 

GAGCAGCTGATCAACCAGCGCATCGAGGAG7TCGCCCGCAACCAGGCCATCAGCCGCCTG 

r 

310 320 330 340 350 360 

* * * « * ♦ 

CAGGGCCTGAGCAACCTGTACCAGATCTACGCGGAGAGCTTCGGCGAGTGGGAGGCCGAC 
A 

A 

370 380 ' 390 400 410 42C 

« ♦ « « » ♦ 

CCCACCAACCCCGCCCTGCGCGAGGAGATGCGCATCCAGTTCAACGACATGAACAGCGCC 

430 440 450 460 470 480 

* * ♦ ♦ « « 

CTGACCACCGCCATCCCCCTGTTCGCCGTGGAGAACTACCAGGTGCCCCTGCTGAGCGTG 

490 500 510 520 530 540 

* * ♦ ♦ * ♦ 

TACGTGCAGGCCGCCAACCTGCACCTGAGCGTGCTGCGCGACGTGAGCGTGTTCGGCCAG 
• • • C . . . 

550 560 570 580 590 600 

* * • • V , 

CGCTGGGGCTTCGACGCCGCCACCATCAACAGCCGCTACAACGACCTGACCCGCCTGATC 
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FIG, I ICGNT) 

tj^O 620 630 6*fo 650 660 

synir.nze ggcaactacaccgaccacgccgtgcgctggtacaacaccggcctggagcgcgtgtggggc 

bssyn * ^ 

svnf ui . ."Od 

670 680 690 700 710 720 

★ ♦ * ♦ * • 

svniT.nize CCCGACAGCCGCGACTGGATCCGCTACAACCAGTTCCGCCGCGAGCTGACCCTGACCGTG 

bssvn A.G 

synf ui .3K3d A.G 

730 740 750 760 770 780 

« « * * ♦ » 

synlT . nize CTGGACATCGTGAGCCTGTTCCCCAACTACGACAGCCGCACCTACCCCATCCGCACCGTG . 

bssvn • • • 

synful..T.cd 

790 800 310 820 830 840 

♦ WW*** 

svnlT . r.ze AGCCAGCTGACCCGCGAGATCTACACCAACCCCGTGCTGGAGAACTTCGACGGCAGCTTC 

bssvn r 

3vnful.roG T 

350 S60 370 330 390 90C 

* ♦ « * « • 

synlT . r.ze CGCGGCAGCGCCCAGGGCATCGAGGGCAGCATCCGCAGCCCCCACCTGATGGACATCCTG 

bssyn • • • • 

synf ul. nod 

910 920 930 940 950 96C 

****** 

syniT . r.Z3 >.ACAGCATCACCArCTACACCGACGCCCACCGCGGCGAGTACTACTGGAGC33CCACCA3 

bssvn 

svnf ul . r-- 

970 980 990 1000 iOlO 1022 

* « « * « * 

3yniT,.T.2S ATCATGGCCAGCCCCGTGGGCTTCAGCGGCCCCGAGTTCACCTTCCCCCTGTACGGCACC 

bssyn C 

synf ui. nod C 

1030 1040 1050 1060 1070 1080 

» * » ♦ * * 

syn IT . e ATGGGCAACGCCGCCCCCCAGCAGCGCATCGTGGCCCAGCTGGGCCAGGGCGTGTACCGC 

bssyn T. .A. .T A A 

synf ui. nod T. .A. .T A .A 

1090 1100 1110 1120 1130 1140 

★ * * * ♦ * 

synlT . nize ACCCTGAGCAGCACCCTGTACCGCCGCCCCTTCAACATCGGCATCAAC.XACCAGCAGCTG 

bssyn T. . A. .T. 

synful.aod T. .A, 

1150 1160 1170 1180 1190 1200 

* * * * * * 

synlT . aze AGCGTGCTGGACGGCACCGAGTTCGCCTACGGCACCAGCAGCAACCTGCCCAGCGCCGTC 

bssvn 

synful.aod 
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FIG. 3 ICONT) 

ii^J 1220 1230 12*»^ 1250 1260 

synlT . mze TACCGCAAGAGCGGCACCGTGGACAGCCTGGACGAGATCCCCCCCCAGAACAACAACGTG 

bssvn T 

synf ul . mod T 

1270 1280 1290 1300 1310 1320 

* * » . ♦ * • 

synlT . nize CCCCCCCGCCAGGGCTTCAGCCACCGCCTGAGCCACGTGAGCATGTTCCGCAGCGGCTTC 

bssyn , .A, .T. .A T T 

synful .mod ..A.,T..A.. T T 

1330 1340 1350 1360 1370 1380 

synlT . mze AGCAACAGCAGCGTGAGCATCATCCGCGCCCCCATGTTCAGCTGGATCCACCGCAGCGCC 

bssyn T. . A. .T T T. . . 

svnfi:! .mod T..A..T T T-.. 

1390 1400 1410 1420 1430 1440 

synlT . nze GAGTTCAACAACATCATCCCCAGCAGCCAGATCACCCAGATCCCCCTGACCAAGAGCACC 

bssvn 

svnf ul . mod 

1450 1460 1470 1480 1490 150C 

* ♦ * ♦ * « 

synlT . mze AACCTGGGCAGCGGCACCAGCGTGGTGAAGGGCCCCGGCTTCACCGGCGGCGACATCCTG 

bssvn 

synful. mod 

1510 1520 1530 1540 1550 1560 

»♦**** 

bssyn 

^ * * * * * /*4 

* — ^ • » 

1570 1580 1590 1600 1610 162C 

* * •* « « V 

synlT . mze CAGCGCTACCGCG7GCGCATCCGCTACGCCAGCACCACCAACCTGCAGTTCCACACCAGC 

bssyn C 

synful. mod C 

1630 1640 1650 1660 1670 1630 

* * « * « # 

synlT . mze ATCGACGGCCGCCCCATCAACCAGGGCAACTTCAGCGCCACCATGAGCAGCGGCAGCAAC 
bssvn 

synful. mod 

1690 1700 1710 1720 1730 1740 

synlT . mze CTGCAGAGCGGCAGCTTCCGCACCGTGGGCTTCACCACCCCCTTCAACTTCAGC.\ACGGC 
bssyn 

synful. mod 

1750 1760 1770 1780 1790 1800 

synlT . mze AGCAGCGTGTTCACCCTGAGCGCCCACGTGTTCAACAGCGGCAACGAGGTGTACATCGAC 

bssyn 

synful . mod 
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rlG. 3 (uONT) . 

18*w 1820 1830 184w 1350 1360 

* * * ♦ « » 

synlT . mze CGCATCGAGTTCGTGCCCGCCGAGGTGACCTTCGAGGCCGAGTACGACCTGGAGCGCGCC 

bssyn A.G*.T 

svnful .mod : A.G - .T 

' 1870 1880 1890 1900 1910 1920 

« * « ■ * « . » 

5 vnlT . mze CAGAAGGCCGTGAACGAGCTGTTCACCAGCAGC.iiACCAGATCGGCCTGAAGACCGACGTG 

■^Dssyn 

synful.raod 

1930 1940 1950 1960 1970 1980 

* * * * « ♦ 

s vnlT . mze ACCGACTACCACATCGACCAGGTGAGCAACCTGGTGGAGTGCCTGAGCGACGAGTTCTGC 

bssyn T 

svnful . mod T 

1990 2000 2010 2020 2030 2040 

* * * ♦ * ♦ 

svnlT . mze CTGGACGAGAAGAAGGAGCTGAGCGAGAAGGTGAAGCACGCCAAGCGCCTGAGCGACGAG 

bssyn 

5vnful.mod 

2050 2060 20^0 2080 2090 210C 

* ♦ « ♦ » « 

synlT . mze CGC\ACCTGCTGCAGGACCCC.\ACTTCCGCGGCATCAACCGCCAGCTGGACCGCGGCTGG 

bssyn ' 

synf ui . mod 

2110 2120 2130 2140 2150 2160 

* « « « « « 

svnIT . aze CGCGGCAGCACCGACATCACCArCCAGGGCGGCGACGACGTGTTC.\AGGAGAA -TACGTG 
bssvn 

svnful . nod 

2170 2180 2190 2200 2210 2220 

* « « * « ♦ 

synlT . mze ACCCTGCTGGGCACCTTCGACGAGTGCTACCCCACCTACCTG7ACCAGAAGA7CGACGAG 

bssyn •■ — 

synf ul . mod 

2230 2240 2250 2260 2270 2280 

* « « * « * 

synlT . mze AGCAAGCTGAAGGCCTACACCCGCTACCAGCTGCGCGGCTACATCGAGGACAGCCAGGAC 

bssvn r 

synf ul. mod 

2290 2300 2310 2320 2330 2340 

* « « ♦ * * 

synlT . mze CTGGAGATCTACCTGATCCGCTACAACGCCAAGCACGAGACCGTGAACGTGCCCGGCACC 

bssyn 

synf ul . mod 



2350 2360 2370 2380 2390 2400 

♦ ♦ ♦ ♦ ♦ ♦ 

synlT . mze GGCAGCCTGTGGCCCCTGAGCGCCCCCAGCCCCATCGGCAAGTGCGCCCACCACAGCCAC 

bssyn 

svnful . mod 



■ I 
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FIG. 3 (CONT) • 

24xvJ 2420 2430 244o 2450 2460 

* * « » * * 

svnlT . raze CACTTCAGCCTGGACATCGACGTGGGCTGCACCGACCTGAACGAGGACCTGGGCGTGTGG 

bssvn 

synf ul . mod 

2470 2480 2490 2500 2510 2520 

» ♦ « ♦ # t 

S vnlT . mze GTGATCTTCAAGATCAAGACCCAGGACGGCCACGCCCGCCTGGGCAACCTGGAG7TCCTG 

cssyn 

svnf ul .mod 

2530 2540 2550 2560 2570 2580 

«**♦»« 

SvnlT.mze GAGGAGAAGCCCCTGGTGGGCGAGGCCCTGGCCCGCGTGAAGCGCGCCGAGAAGAAGTGG 

fcssyn ~ • 

£vnf ul , mod 

2590 2600 2610 2620 2630 2640 

* « « * * , * 

ivnil . nze CGCGACAAGCGCGAGAAGCTGGAGTGGGAGACCAACATCGTGTACAAGGAGGCCAAGGAG 

cssvn 

ivnf ul . mod 

2550 2670 zeSO 2690 270C 

± * * . •» « < 

= ynl7 . nze AGCGTGGACGCCCTGTTCGTGAACAGCCAGTACGACCGCCTGCAGGCCGACACCAACATC 

^ssyn — 

svnf ul . mod 

2710 2720 2730 2740 2750 2760 

* » « ♦ * » 

S;'rii i ^DZe GCCATGATCCACGCCGCCGACAA3CGCGTGCACAGCAT3CGCGAGGCCTACCTGCCCGAG 

tssvn 

synful . mod 7 

1770 2780 2790 2300 2810 282C 

* * * « # * 

synlT . mze CTGAGCGTGATCCCCGGCGTGAACGCCGCCATCTTCGAGGAGCTGGAGGGCCGCATCTTC 
cssyn 

synf ul .mod , 



2830 2840 2850 2860 2870 2880 

* * « ♦ « t 

synlT . mze ACCGCCTTCAGCCTGTACGACGCCCGCAACGTGATCAAGAACGGCGACTTC\ACAACGGC 
ijssyn 

synf ul . mod 

2890 2900 2910 2920 2930 2940 

* * ♦ ♦ * ♦ 

synlT . mze CTGAGCTGCTGGAACGTGAAGGGCCACGTGGACGTGGAGGAGCAGAACAACCACCGCAGC 
Dssyn 

synf ul. mod 

2950 2960 2970 2980 2990 3000 

* • « ♦ ♦ ♦ 

bssyn 

synf ul .mod 
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flG. 3 luONT) 

30^^ 3020 3030 304 3050 3060 
* , « « ♦ * ♦ 

syniT . nze CGCGGCTACATCCTGCGCGTGACCGCCTACAAGGAGGGCTACGGCGAGGGCTGCGTGACC 
j^ggy^ - 

synf ul . nod • 

3070 30B0 3090 3100 3110 3120 

* * ♦ * * • 
synlT . nze ATCCACGAGATCGAGAACAACACCGACGAGCTGAAG7TCAGCAACTGCGTGGAGGAGGAG 

bssyn 

synf mod ^ 

3130 3140 3150 3160 3170 3180 

* ♦ * * * * 
synlT . r:2e GTGTACCCCAACAACACCGTGACCTGCAACGACTACACCGCCACCCAGGAGGAGTACGAG 

bssyn 

3vr.ful.rnod 

3190 3200 3210 3220 3230 3240 

syr.iT.r.2e GGCACCTACACCAGCCGCAACCGCGGCTACGACGGCGCCTACGAGAGCAACAGCAGCGTG 

•nrf S S t • 

syrf ul . .■nod 

= . ^ 2260 "ZT 3280 3290 3301 

«**»** 
syr.lT . nze CCCGCCGACTACGCCAGCGCCTACGAGGAGAAGGCCTACACCGACGGCCGCCGCGACAAC 

bssvn * 

svnf ul . nod 

3310 3320 3330 3340 3350 3360 

synlT ,-26 CCCTGCGAGAGCAACCGCGGCTACGGCGACTACACCCGCCTGCCCGCCGGCTACGTGACC 
bssvn " 

^ » #**#*••#•♦»•#•••••#•••••••**•••■•'*******"******** * 

3370 3380 3390 3400 3410 342C 

* * ♦ * * « 

syn 1 T . nze AAGGAGCTGGAGTACTTCCCCGAGACCGACAAGGTGTGGATCGAGATCGGCGAGACCGAG 

bssyn 

svnf ul . mod . 

3430 3440 3450 3460 

* ♦ * * 

5 vnlT . nize GGCACCTTCATCGTGGACAGCGTGGAGCTGCTGCTGATGGAGGAGTAG 

bssvn ^ 

svnf ul . mod 



* « * • 
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rIG. 4 



iO 20 30 ^0 50 60 

« ♦ « « « » 

3thkl:rhd atggataacaatccgaacatcaatgaatgcattccttataattgtttaagtaaccctgaa 
pmonbt c c..a c a..c..c..c..g a... 

bssvn C./. ..C..C C..G C..C..C..C..CC.G.-C.....C..G 

70 80 90 100 110 120 

* ♦ « * * ♦ 

3THKURHD GTAGAAGTATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCAATCGATATT'TCCTTG 

PMONBT ..T C,T C.C..T C T..C C-.C 

bssyn , .G, .G. .GC.G. .C. .C. .GC.C. .C. .G. .C. ,C C C. CAG . C. . 

130 140 150 160 170 180 

» ♦ ♦ ★ * « 

3THKURHD TCGCTAACGCAATTTCTTTTGAGTGAATTTGT7CCCGGTGCTGGATTTGTGTTAGGACTA 
PMONBT ■ . CT •G- .A« .G« ■ . • . GC .C.-C-.G. .C.^G* ,A««»««»»«G»«C». TC •C..*,,. 
bssyn AGC. .G..C..G-,C.,GC-...C-.G,.C,.G C. .C. .C. .C^ ^ .C.G. .G 

190 200 210 220 230 240 

* » * * » ♦ 

3THKURHD GTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATT 

PMONBT C.C.C T..C A T C,.G.,G 

bssyn .,G..C..C..C C..C..C..C... AGC ..G C..C..G..G.,G,.C 

250 260 270 280 290 3QC 

* ♦ * » « » 

BTHKURHD GAACAGTTAATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCArTTCTAGATTA 

PMONBT ..G G..C G,.G..C G C G C. G-.G 

bssyn . .G. . ,C.G. .C GC.C. .C. .G- .G CC.C G CAGCC.CC.G 

310 320 330 340 350 360 

* ♦ * * « ♦ 

3 iHKwRHZ GAAGGACTA^GCAAT 2rTTA.TCAAATTTACGCAGAATCTTTTAGAGA.>j . L-GwAAGCAGAT 

PMONBT T.G C..C C.T GAGC , . C C. . . 

^ ~ • • (™ i« ^ /~ \ ^* ^ * 

— ..^..^...w ...w v«< • - wtw <j * .w* 

370 380 390 400 410 42: 

« ♦ * « « » 

BTHKURHD CCTACTAATCCAGCATTAAGAGAAGAGATGCGTATTCAATTCAATGACATGAACAGTGCC 

PMONBT C TC. CC. C. , G . . A C C, . . 

bssyn ..C,,C..C..C..CC.GC.C..G C..C..G C C,.. 

430 440 450 460 470 480 

* * « V « ft 

BTHKURHD CTTACAACCGCTATTCCTCTTTTTGCAGTTCAAAATTATCAAGTTCCTCTTTTATCAGTA 

PMONBT T.G.,C..A C. . AT. G.-C C . .G , .C. .C Z, .G. .G 

bssyn ..G-.C C,.C,.C..G.,C..C..G..G..C..C..G..G..C..GC. GAGC . . G 

490 500 510 520 530 540 

* * « ♦ * • 

BTHKURHD TATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGGACAA 

PMONBT ..C A..T...C.T..CC. GAGC . - GC. TC . . . . C . . . AGC G . . . 

bssyn . . C . . G . - G . . C . . C. . CC . G . . CC .GAGC . . GC . . C . C . . C . . CAGC C..C..G 

550 560 570 580 590 600 

* « * - * * « 

BTHKURHD AGGTGGGGATTTGATGCCGCGACTATCAATAGTCGTTATAATGATTTAACTAGGCTTATT 

PMONBT C T..A..C C C..C..CC.T G,,. 

bssyn CC C..C..C C..C C..C..C..C..C..CC.G. .CC.C..G..C 
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3THKUSHD 

PMONBT 

cssvn 



6iJ 



620 



630 
* 



54o 
* 



650 



660 
* 



GGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGA 

..A C..C..C,.C T..T C.-T..C..G C T 

C. ,C. .C . .C. .C. .G C..C..CC.G C..G T 



STHKLT^D 
^PMONBT 

bssyn 



3THKURHD 

PMONBT 

bssyn 



3THKL11HD 
PMONBT 

::ssyr. 



670 680 690 700 710 720 

* * * . * * * 

CCGGATTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTA 

..T , T C. .C. ,G. .C. .G G,.C,.C..A,.T 

. . C • * CAGCC .C - .C C. .G, .C* .C. .G. . CC • CC . C . . GC «G« .C« .G. .C. .G 



730 



740 750 760 770 780 

« « « ♦ ★ # 

rTAGATATCGTTTCTCTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTT 

..G..C,.T..G C..C CTCC C..C.,T. ...C . . T G 

C.G. .C GAGC. .G..C-.C C..C..CC.C.,C..C..C..C,.C..C..G 



790 300 810 320 830 340 

* « * »' • * « 

TCCCAATTAACAAGAGAAATTTATACAAACCCAGTATTAGAAAATTTTGATGGTAGTTTT 

C.T..C C T TC.7..G..C..C..C C..C 

AG . . . GC , G . . CC . C . . G C. . C C . . GC . G . . G . . C . . C . . C . . C . . C. . C 



3THKl?.HD 
PMONBT 

bssyn 



350 



360 



r c n 



90 



9QC 



:gaggctcggctcagggcatagaaggaagtattaggagtccacatttgatggatatactt 

.T.,T..T..C..A.,T,.C CTCC ..C C C C.CT.G 

. C . . . AGC , .C C..G..C..C..CC.C,.C..C. .CC C.C.G 



PMCNET 



. :3 • 



910 920 930 940 950 960 

★ * * ♦ ★ ♦ 

AACAGTATAACCATCTAT ACGGAIGCTCATAGAG 'ATTATTGGTCAG GGCATCAA 
C T C.GC C G C T..A..C..G 

r..r c..c..c..:..cc.c..c..3..c,.c... agc . . c. . 2. . c 



5THKURHD 
PMONBT 

bssyn 



970 



980 990 LOGO 1010 1020 

* ♦ « * » 



ataatggcttctcctgtagggttttcggggccagaattcacttttccgctatatggaact 

..C C A..T.,A.. CAGC C..G..r..C T-.C 

. . C CAGC . . C . . C . . C . . CAGC . .C. .C - .G C. .C. .C. .G. .C. .C. .C 



3THKURHD 

PMONBT 

Dssvn 



1030 1040 1050 1060 1070 1080 

★ » * * ♦ * 

ATGGGAAATGCAGCTCCACAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGA 

r» . ^ T 

C..C.,T,.A..T..G..G..C..C..G..A..G..G..C A CC.C 



3THKURHD 

PMONBT 

bssvn 



1090 



1100 



110 



1120 



1130 



1140 



ACATTATCGTCCACTTTATATAGAAGACCTTTTAATATAGGGATAAATAATCAACAACTA 

..C..G..T C-.G..C C..C C.,T..C,.C..C..G T 

■ • CC • GAGC^^G • • • ^r^* • G a • w • TC C • • C « • C • • C • • C • • C • • C • aG* • G 



1150 



1160 



1170 



1130 



1190 



1200 



BTHKURHD 
PMONBT 

bssyn 



TCTGTtCTTGACGGGACAGAATTTGCTTATGGAACCTCCTCAAATTTGCCATCCGCTGTA 

. .C A G. .C. .C T. .T. ,C T 

AGC . .Ga.G C. .C. .G. .C. .C..C. .C, . .AG. AGC . . CC . . . . CAG ...C.G 
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BTHKURHD 

PMONBT 

bssvn 



FIG. 



4 
*1 



ICOiNT; 



12^J 1220 1230 124w 1250 1260 

♦ « ♦ « » » 

TACAGAAAAAGCGGAACGGTAGATTCGCTGGATGAAATACCGCCACAGAATAACAACGTG 

. , G C . , T CT . . . . C C . . A C T • . . 

...C.C.G C..C..G.. CAGC C . . G , . C. . C . . T C 



3THXURHD 
PMONBT 

bssvn — 



1270 1280 1290 1100 1310 1320 

* ♦ ♦ * « ♦ 

CCACCTAGGCAAGGATTTAGTCATCGATTAAGCCATGTTTCAATGTTTCGTrCAGGCTTT 

C CTCC ..CA.G, .G C..G..C C C..A. .C 

C.A..G..C..C..C.,C..TC.G C. GAGC C . . CAGT C 



BTHKURHD 
PMONBT 

bssvn 



1330 1340 1350 1360 1370 1380 

* ♦ * * ♦ •» 

AGTAATAG7AGTGTAAGTATAATAAGAGCTCCTATG7TCTCTTGGATACATCGTAGTGCT 

. .C. .C. . .TCC. .G..C..C..C A T 

..C..C..C. .C.,G..C..C..CC.r..A AGC T..C.,C C 



BTHKURHD 

PMONBT 

bssvn 



1390 1400 1410 1420 1430 

» « * * « 

GAATTTAATAATATAATTCCTTCATCA — CAAATTACACAAATACCTTT.^ACAAAATCTA 

• <sj. .w« • • • •X- 'o* . < • • .W. vL.* • • • «W« aO* .W. . • • 

. .G. .C. .C. .C..C. .C. . — . .G. .GC. .G. ,C...C. .G. .C. .CC.G. .C. . GAGC . 



37HKURHD 
PMCNBT 

bssvn 



• *T S - 



450 



147C 



148C 



1490 



CTAATCTTGGCTCTGGAACTTCTGTCGTTAAAGGACCAGGATTTACAGGAGGAGATATTC 

. . . ,C A G C. .C 7 

. C . . C . . G - . . AGC . . C . . CAGC ..G. .G..G-.C.,C..C..C..C..C..C..C..C. 



37r:KURH 
FM0NB7 



• ^ 

1 3oC 



1500 1510 1520 1530 1540 

« ★ * * * * 

7 * wGAAG AAC77CACC7GGCCAG A777CAACC * 7AAGAG7AAA7A 1 7A,s- . GC ACCAT7A7 



AG^ 



C. .C 



.C.T. 



37HKURHD 
PH0NB7 

bssyn 



560 1570 1580 1590 1600 16i: 

♦ * » « « T 

CACAAAGA7A7CGGG7AAGAA77CGC7ACGC77C7ACCAC.\AA777AC.\A77CCATACA7 

.7 T..C..G 7 A 7..C..G C..C. 

GC. .GC.C. .C. .C. .CC.C, .C CAGC C. .CC.G. .G C . . CA 



B7HKURHD 
PMONBT 

bssyn 
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64 . ATGGACCTGC TGCCCGACGC CCGCATCGAG GACAGCCTGT GCATCGCCGA GGGCAACAAC 
MetAspLeu LeuProAsp AlaArglleGlu AspSerLeu CysIleAla GluGiyAsnAsn 

124 ATCGACCCCT TCG7GAGGGC CAGCACCGTG CAGACCGGCA 7CAACATCGC CGGCCGCATC 
IlaAspPro PheValSer AlaSerThrVal GlnThrGly IleAsnlle AlaGlyArglle 

184 CTGGGCGTGC TGGGCGTGCC CTTCGCCGGC CAGCTGGCCA GCTTCTACAG CTTCCTGGTG 

LeuGlyVal LpeuGlyVal ProPheAlaGly GlnLeuAla SerPheTyr SerPheJLeuVai 

24 4 GGCGAGCTGT GGCCCCGCGG CCGCGACCAG TGGGAGATCt TCCTGGAGCA CGTGGAGCaG 
GlyGluLeu TrpProArg GlyArgAspGln TrpGluIle PheLeuGlu HisValGluGln 

304 CTGATCAACC AGCAGATCAC CGAGAACGCC C5CAACACCG CCCTGGCCCG CCTGCAGGGC 
LeuIleAsn GlnGlnlle ThrGlxiAsnAla ArgAsnThr AlaLeuAla ArgLeuGlnGly 

3 64 CTGGGCGACA GCTTCCGCGC CTACCAGCAG AGCCTGGAGG ACTGGCTGGA GAACCGCGAC 
LeuGlyAsp SerPheArg AlaTyrGlnGln SerLeuGlu AspTrpLeu GluAsnArgAsp 

424 GACGCCCGCA CCCGCAGCGT GCTGTACACC CAGTACATCG CCCTGGAGCT GGACTTCCTG 

AspAlaArg ThrArgSer ValLeuTyrThr GinTyrlle AlaLeuGlu LeuAspPtieLeu 

484 AACGCCATGC CCCTGTTCGC CATCCGCAAC CAGGAGGTGC CCCTGCTGAT GGTGTACGCC 
AsnAlaMec ProLeiiPhe AlalleArgAsn GliiGluVal ProLeuI«eu MetValTyrAla 

544 CAGGCCGCCA ACCTGCACCT GCTGCTGCTG CGCGACGCCA GCCTGTTCGG CAGCGAGTTC 
GlnAlaAIa Asnl^uHis Leui-euLeuLeu ArgAspAla SerL-euPhe GlySerGluPhe 

604 GGCCTGACCA GCCAGGAGAT CCAGCGCTAC TACGAGCGCC AGGTGGAGCG CACCCGCGAC 
GlyLeuThr SerGlnGlu IleGlnArgTyr TyrGluArg GlnValGlu ArgTlirArgAsp 

664 TACAGCGACT ACTGCGTGGA GTGG7ACAAC ACCGGCCTGA ACAGCCTGCG CGGCACCAAC 
TyrSerAsp TyrCysVal GluTrpTyrAsn ThrGlyLeu AsnSerL-eu ArgGlyThrAsn 

724 GCCGCCAGC7 GGGTGCGCTA CAACCAGTTC CGCCGCGACC TGACCCTGGG CGTGCTGGAC 
AlaAlaSer TrpValArg TyrAsnGlnPhe ArgArgAsp L^uThrLeu GlyValLeuAsp 

784 CTGG7GGCCC 7G77CCCCAG C7ACGACACC CGCACC7ACC CCA7CAACAC CAGCGCCCAG 
LeuValAIa LeuPhePro Ser7yrAsp7hr Arg7hr7yr ProIleAsn 7hrSerALaGLa 

844 C7GACCCGCG AGG7G7ACAC CGACGCCA7C GGCGCCACCG GCG7GAACA7 GGCCAGCATG 
LeuThrArg GluValTyr ThxAspAlalle GlyAla7hr GlyValAsn tSetAlaSerMet 

904 AAC7GG7ACA ACAACAACGC CCCCAGC77C AGCGCCA7CG AGGCCGCCGC CATCCGCAGC 
AsnTrp7yr AsnAsnAsn AlaProSerPhe SerAlalle GluAlaAla AlalleArgSer 

964 CCCCACC7GC 7GGAC77CC7 GGAGCAGC7G ACC:A7C77CA GCGCCAGCAG CCGC7GGAGC 
ProHisI-eu LeiiAspPhe LeuGluGlnLeu 71irIlePhe SerAlaSer SerArgTrpSer 

024 AACACCCGCC AC:A7GACC7A C7GGCGCGGC CACACCA7CC AGAGCCGCCC CA7CGGCGGC 
AsnThrArg HisMet71ir 7yr7rpArgGly His7hrlle GinSerAzrg ProIleGiyGly 
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1084 GGCCTGAACA CCAGCACCCA CGGCGCCACC AACACCAGCA TCAACCCCGT GACCCTGCGC 
GlyLeuAsn ThrSerThr HisGlyAlaTlir AsnThrSer IleAsnPro VaiThrLeuArg 

1144 TTCGCCIAGCC GCGACGTGTA CCGCACCGAG AGCTACGCCG GCGTGCTGCT G7GGGGCATC 
PheAlaSer ArgAspVal TyrArgThrGlu SerTyrAla GlyValLeu LeuTrpGlylle 

1204 TACCTGGAGC CCATCCACGG CGTGCCCACC GTGCGCTTCA ACTTCACCAA CCCCCAGAAC 
TyrLeuGlu ProIleHis GlyValProThr ValArgPhe AsnPheTtir AsnProGlnAsn 

1264 ATCAGCGACC GCGGCACCGC CAACTACAGC CAGCCCTACG AGAGCCCCGG CCTGCAGCTG 
IleSerAsp ArgGlyThr AlaAsnTyrSer GlnProTyr GluSerPro GlyLeuGlnLeu 

1324 AAGGACAGCG AGACCGAGCT GCCCCCCGAG ACCACCGAGC GCCCCAACTA CGAGAGCTAC 
LysAspSer GluThrGlu LexiProProGlu TlirThrGlu ArgProAsn TyrGluSerTyr 

1384 AGCCACCGCC TGAGCCACAT CGGCATCATC CTGCAGAGCC GCGTGAACGT GCCCGTGTAC 
SerHisArg LeuSerHis IleGlyllelle LeuGInSer ArgVaLAsn VaiProVaiTyr 

444 AGCTGGACCC ACCGCAGCGC CGACCGCACC AAC\CCATCG GCCCCAACCG CATCACCCAG 
SerTrpThr HisArgSer AlaAspArgThr AsnTlirlle GlyProAsn ArglleThrGia 

1504 ATCCCCATGG TGAAGGCCAG CGAGCTGCCC CAGGGCACCA CCGTGGTGCG CGGCCCCGGC 
IleProMet ValLysAla SerGluLeuPro GlnGlyThr ThrValVal ArgGlyProGly 

1564 TTCACCGGCG GCGACATCCT GCGCCGCACC AACACCGGCG GCTTCGGCCC CATCCGCGTG 
PheThrGiy GlyAspIle LeuArgArgThr AsnThrGly GlyPheGly ProIieArgVai 

1624 ACCGTGAACG GCCCCCTGAC CCAGCGCTAC CGCATCGGCT TCCGCTACGC CAGCACCGTG 
ThrValAsn GlyProLeu ThrGlnArgTyr Argil eGly PheArgTyr AlaSerThrVal 

1684 GACTTCGACT TC7TCGTGAG CCGCGGCGGC ACCACCGTGA ACAACTTCCG CTTCCTGCGC 
AspPheAsp PhePheVal SerArgGlyGly ThrThrVal AsnAsnPhe ArgPheLeuArg 

174 4 ACCATGAACA GCGGCGACGA GCTGAAG7AC GGCAACTTCG TGCGCCGCGC CTTCACCACC 
ThrMetAsn SerGlyAsp GluLeuLysTyr GlyAsnPhe ValArgArg AlaPheThrThr 

1804 CCCTTCACCT TCACCCAGAT CCAGGACATC ATCCGCACCA GCATCCAGGG CCTGAGCGGC 
ProPheThr PheThrGln IleGlnAspIle IleArgThr SerlleGln GlyLeuSerGly 

1864 AACGGCGAGG TGTACATCGA CAAGATCGAG ATCATCCCCG TGACCGCCAC CTTCGAGGCC 
AsnGlyGlu ValTyrlle AspLysIleGlu IlellePro ValThrAla ThrPheGluAla 

1924 GAG7ACGACC TGGAGCGCGC CCAGGAGGCC GTGAACGCCC TGTTCACCAA CACCAACOX: 
GluTyrAsp LeuGluArg AlaGlnGluAla ValAsnAla LeuPheThr AsnThrAsnPro 

1984 CGCCGCCTGA AGACCGACGT GACCGACTAC CACATCGACC AGG7GAGCAA CC7GG7GGCC 
AxgArgLeu LysThrAsp ValThrAspTyr HisIleAsp GlnValSer AsnLeuValAla 

204 4 7GCC7GAGCG ACGAG77C7G CCTGGACGAG AAGCGCGAGC 7GC7GGAGAA GG7GAAG7AC 
CysLeuSer AspGluPhe CysLeuAspGlu LysArgGlu LeiiLeuGlu LysValLys7yr 
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2104 GCC^GCGCZ TGAGCGACSA GCGCAACCTG CTGCAGGACC CCAACTTCAC CAGCATCAAC 
AlaLysArg LeuSerAsp GluArgAsnLeu I-euGlnAsp ProAsnPhe ThrSerlleAsn 

2164 AAGCAGCCC3 ACTTCATCAG CACCAACGAG CAGAGCAACT TCACCAGCAT CCACGAGCAG 
LysGlnPro AspPhelle SerThrAsnGlu GlnSerAsn PheTlirSer IleHisGluGln 

2224 AGCGAGCACG GCTGGTGGGG CAGCGAGAAC ATCACCATCC AGGAGGGCAA CGACGTGTTC 
SerGluHis GlyTrpTrp GlySerGluAsn IleThrlle GlnGluGly AsnAspVaiPhe 

2284 AAGGAGAACT ACGTGACCC? GCCCGGCACC TTCAACGAGT GCTACCCCAC CTACCTGTAC 
LysGluAsn TyrValThr LeuProGlyThr PheAsnGlu CysTyrPro ThrrTyrLeuTyr 



2344 CAGAAGATCG GCGAGAGCGA GCTGAAGGCC TACACCCGCT ACCAGCTGCG CGGCTACATC 
GlnLysIle GlyGluSer GluI»euJLysAla TyrThrArg TyrGlnlreu ArgGlyTyrlle 

2404 GAGGACAGCC AGGACCTGGA GATCTACCTG ATCCGCTACA ACGCCAAGCA CGAGACCCTG 
GluAspSer GlnAspLeu GluIleTyrLeu IleArgTyr AsnAlaLys HisGluThrLeu 

24 64 GACGTGCCCG GCACCGAGAG CCTGTGGCCC CTGAGCGTGG AGAGCCCCAT CGGCCGCTGC 
AspValPro GlyThrGlu SerLeuTrpPro LeuSerVai GluSerPro IleGlyArgCys 

2S24 GGCGAGCCCA ACCGCTGCGC CCCCCACTTC GAGTGGAACC CCGACCTGGA CTGCAGCTGC 
GlyGluPro AsnArgCys AlaProHisPhe GluTrpAsn ProAspLeu AspCysSerCys 

2584 CGCGACGGCG AGAAG7GCGC CCACCACAGC CACCACTTCA GCCTGGACAT CGACGTGGGC 
ArgAspGly GluLysCys AlaHisHisSer HisHisPhe SerLeuAsp IleAspValGly 

2 64 4 TGCACCGACC TGCACGAGAA CCTGGGCGTG TGGGTGGTGT TCAAGATCAA GACCCAGGAG 
CysThxAsp LeuHisGlu AsnLeuGlyVal TrpValVal PheLysIle LysThrGlnGlu 

2704 GGCCXCGCCZ GCCTGGGCAA CCTGGAGTTC ATCGAGGAGA AGCCCC7GCT GGGCGAGGCC 
GlyHisAla ArgLeuGly AsnLeuGliiPhe IleGluGlu LysProLeu LeuGlyGluAla 



2764 CTGAGCCGCG TGAAGCGCGC CGAGAAGAAG TGGCGCGACA AGCGCGAGAA GCTGCAGCTG 
LeuSerArg ValLysArg AlaGluLysLys TrpArgAsp LysArgGlu LysLeuGlnLeu 

2824 GAGACCAAGC GCGTGTACAC CGAGGCCAAG GAGGCCGTGG ACGCCC7GTT CGTGGACAGC 
GluThrLys ArgValTyr ThxGluAlaLys GluAlaVai AspAiaLeu PheValAspSer 

2884 CAGTACGACC GCCTGC\GGC CGACACCAAC ATCGGCATGA TCdACGCCGC CGACAAGCTG 
GlnTyrAsp ArgLeuGln AlaAspThrAsn IleGlyf^et liefiisAla AlaAspLyaLeu 

2944 G7GCACCGCA 7CCGCGAGGC C7ACC7GAGC GAGC7GCCCG 7GA7CCCCGG CGTGAACGCC 
VaiHisArg IleArgGlu Ala7yrLeuSer GluI>eiiPro ValliePro GlyVaiAsnAla 

3004 GAGA7C7TCG AGGAGC7GGA GGGCCACA7C A7CACCGCCA TCAGCC7G7A CGACGCCCGC 
GluIlePhe GluGluLeu GluGlyHisIle Ile7hxAla IleSerLeu 7yrAspAlaArg 
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3064 AACGTGGTGA AGAACGGCGA CTTCAACAAC 
AsnValVal LysAsnGly AspPheAsiiAsn 

3124 GTGGACGTGC AGCAGAGCCA CCACCGCAGC 
ValAspVal GlnGinSer HisHisArgSer 

3184 G7GAGCCAGG CCGTGCGCGT GTGCCCCGGC 
ValSerGlii AlaValArg VaiCysProGly 

3244 AAGGAGGGCT ACGGCGAGGG CTGCGTGACC 
LysGluGly TyrGlyGlu GlyCysValTlir 

3304 CTGAAG7TCA AGAACCGCGA GGAGGAGGAG 
LeuLysPhe LysAsnArg GluGluGluGlu 

3364 GACTACACCG CCCACCAGGG C^CCGCZGGC 
AspTyrThr AlaHisGln GlyThrAlaGly 

34 24 GGCTACGAGG ACGCCTACGA GGTGGACACC 
GlyTyrGlu AspAlaTyr GluValAspThr 

34 84 GAGGAGGAGA CCTACACCGA CGTGCGCCGC 
GluGluGlu ThxTyrTlir AspValArgArg 

354 4 GTGAACTACC CCCCCGTGCC CGCCGGCTAC 
ValAsnTyr ProProVal ProAlaGlyTyr 

3604 ACCGACACCG 7G7GGATCGA GATCGGCGAG 
7hrAsp7hr Val7rprie GluIleGlyGlu 

3664 GAGC7GC7GC 7GATGGAGGA G7AG 
GluLeuLeu LeuMetGlu Glu 



GGCC7GACCT GCTGGAACG7 GAAGGGCCAC 
GlyLeuThr CysTrpAsn ValLysGlyHis 

GACC7GG7GA TCCCCGAG7G GGAGGCCGAG 
AspLeuVal IleProGlu 7rpGluAlaGlu 

7GCGGC7ACA 7CCTGCGCG7 GACCGCCTAC 
CysGlyTyr IleLeuArg Val7tirAlaTyr 

A7CCACGAGA 7CGAGAACAA CACCGACGAG 
IleHisGlu IleGluAsn Asn71irAspGlu 

G7G7ACCCCA CCGACACCGG CACC7GCAAC 
Val7yrPro 7hrAspThr Gly7hrCysAsa 

7GCGCCGACG CC7GCAACAG CCGCAACGCC 
CysAlaAsp AlaCysAsn SerArgAsnAla 

ACCGCCAGCG 7.GAACTACAA GCCGACCTAC 
7hrAlaSer ValAsnTyr LysProThrTyr 

GACAACCACT GCGAGTACGA CCGCGGCTAC 
AspAsnHis CysGluTyr AspArgGlyTyr 

G7GACCAAGG AGCTGGAGTA C77CCCCGAG 
Val7hrLys GluLeuGiu 7yrPheProGlu 

ACCGAGGGCA AGTTCATCG7 GGACAGCG7G 
7hrGluGly LysPhella ValAspSerVai 
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64 ATGGACCTGC TGCCCGACGC CCGCATCSAG GACAGCCTGT GCATCGCCGA GGGCAACAAC 
MecAspLeu LeuProAsp AlaArglleGlu AspSerLeu CysIleAla GluGiyAsnAsn 

124 ATCGAGCCCT TCGTGAGCGC CAGCACCGTG CAGACCGGCA TCAACATCGC CGGCCGCATC 
IleAspPro PheValSer AlaSerThrVal GlnThrGly IleAsnlle AlaGlyArglle 

184 CTGGGCGTGC TGGGCGTGCC CTTCGCCGGC CAGCTGGCCA GCTTCTACAG CTTCCTGGTG 
LeuGlyVal LeuGlyVal ProPheAlaGly GlnLeuAla SerPheTyr SerPheLeuVal 

244 GGCGAGCTGT GGCCCCGCGG CCGCGACCAG TGGGAGATC7 TCCTGGAGCA CGTGGAGCAG 
GlyGluLeu TrpProArg GlyArgAspGin TrpGluIle PheLeuGlu HisValGluGln 

304 CTGATCAACC AGCAGATCAC CGAGAACGCC CGCAACACCG CCCTGGCCCG CCTGCAGGGC 
LeuIleAsn GlnGlnlle ThrGluAsnAla ArgAsnThr AlaLeuAla ArgLeuGlnGly 

364 CTGGGCGACA GCTTCCGCGC CTACCAGCAG AGCCTGGAGG ACTGGCTGGA GAACCGCGAC 
LeuGlyAsp SerPheArg AlaTyrGlnGln SerLeuGlu AspTrpLeu GluAsnArgAsp 

424 GACGCCCGCA CCCGCAGCG7 GCTGTACACC CAGTACATCG CCCTGGAGCT GGACTTCCTG 

AspAlaArg ThrArgSer ValLeuTyrThr GlnTyrlle AlaLeuGlu LeuAspPheLeu 

4 84 AACGCCATGC CCCTGTTCGC CATCCGCAAC CAGGAGG7GC CCCTGCTGAT GGTGTACGCC 
AsnAlaMet ProLeuPhe AlalleArgAsn GlnGluVal ProLeuLeu MetValTyrAla 

544 CAGGCZGCCA ACCTGCACC7 GCTGCTGC7G CGCGACGCCX GCC7G77CGG CAGCGAG77C 
GlnAlaAIa AsnLeuHis LeuLeui^uLeu ArgAspAIa SerLeuPhe GlySerGluPbe 

604 GGCC7GACCA GCCAGGAGA7 CC:AGCGC7AC 7ACGAGCGCC AGG7GGAGCG CACCCGCGAC 
GlyLeu7hr SerGlnGlu IleGlnAx-gTyr 7yrGliiArg GlnValGlu ArgTlirArgAsp 

S64 TACAGCGACT AC7GCG7GGA G7GGTAC:AAC ACCGGCC7GA ACAGCC7GCG CGGCACCAAC 
7yrSerAsp 7yrCysVal GluTrpTyrAsn 7hrGlyLeu AsnSerLeu ArgGlyThrAsn 

724 GCCGCCAGC7 GG67GCGC7A CAACCAG77C CGCCGCGACC 7GACCC7GGG CG7GC7GGAC 
AlaAlaSer 7rpValArg 7yrAsnGlnPhe ArgArgAsp Leu71irLeu GlyValLeuAsp 

784 C7GG7GGCCC 7G77CCCCAG C7ACGACACC CGCACC7ACC CCA7CAACAC CAGCGCCCAG 
LeuVaLAla LeuPhePro Ser7yrAspThr AxgThrTyr ProIleAsn 7hrSerAlaGln 

844 C7GACCCGCG AGG7G7ACAC CGACGCCA7C GGCGCCACCG GCG7GAACA7 GGCCAGCATG 
LeuThrArg GluValTyr ThrAspAlalle GlyAiaThx GlyVaiAsn MetAiaSerttet 

904 AAC7GG7ACA ACAACAACGC CCCCAGC7TC AGCGCCA7CG AGGCCGCCGC C:A7CCGCAGC 
AsnTrpTyr AsnAsnAsn AlaProSerPhe SerAlalle GluAlaAla AlalleArgSer 

964 CCCCACCTGC 7GGAC7TCG7 GGAGCAGC7G ACC:A7C77CA GCGCCAGCAG CCGC7GGAGC 
ProHisLeu LeuAspPhe LeuGluGlnLeu TlirllePhe SerAlaSer SerArgTrpSer 

1024 AACACCCGCC ACA7GACC7A C7GGCGCGGC CACACCA7CC AGAGCCGCCC CA7CGGCGGC 
AsnThrArg HisMetThr TyrTrpAxgGly His7iirlle GInSerArg ProIleGlyGly 
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FIG. 6 (C2NT) 

T 
1 

1084 GGCCTGAACA CCAGCACCCA CGGCGCCACC AACACCAGCA TCAACCCCST GACCCTGCGC 
GlyLeuAsn ThrSerTtxr HisGlyAlaThr A^nTiirSer IleAsnPro valTtirLeuArg 

1144 TTCGCCAGCC GCGACGTGTA CCGCACCGAG AGC7ACGCCG GCGTGCTGCT G7GGGGCATC 
PheAiaSer ArgAspVal TyrArgThrGlu SerTyrAla GlyValLeu LeuTrpGlylle 

1204 TACCTGGAGC CCATCCACGG CGTGCCCACC GTGCGCTTCA ACTTCACCAA CCCCCAGAAC 
TyrLeuGlu Prolleflis GlyVaiProThr ValArgPhe AsnPheThr AsnProGlnAsa 

1264 ATCAGCGACC GCGGC\CCGC CAACTACAGC CAGCCCTACG AGAGCCCCGG CCTGCAGCTG 
IleSerAsp ArgGlyThr AlaAsnTyrSer GlnProTyr GluSerPro GlyLeuGlnLeu 

1324 AAGGACAGCG AGACCGAGCT GCCCCCCGAG ACCACCGAGC GCCCCAACTA CGAGAGCTAC 
LysAspSer GluThrGlu LeuProProGlu ThrThrGlu ArgProAsn TyrGluSerTyr 

1384 AGCCACCGCC TGAGCCACAT CGGCATCATC CTGCAGAGCC GCG7GAACGT GCCCGTGTAC 
SexHisAxg LeuSerflis IleGlyllelle LeuGlnSer ArgValAsn ValProValTyr 

1444 AGCTGGACCC ACCGCAGCGC CGACCGCACC AACACCATCG GCCCCAACCG CATCACCCAG 
SerTrpThr HisArgSer AlaAspArgThr AsnTIirlle GlyProAsn ArglleThrGln 

1504 ATCCCCATGG TGAAGGCCAG CGAGCTGCCC CAGGGCACCA CCGTGGTGCG CGGCCCCGGC 
IleProMet ValLysAla SerGluLeuPro GlnGlyThr ThrValVal ArgGlyProGly 

1564 TTCACCGGCG GCGACATCCT GCGCCGCACC AACACCGGCG GCTTCGGCCC CATCCGCGTG 
PheThrGly GlyAspIle LeuAxgArgThr AsnThrGly GlyPheGly ProIleArgVal 

1S24 ACCGTGAACG GCCCCCTGAC GCAGCGCTAC CGCATCGGCT TCCGCTACGC CAGCACCGTG 
ThrValAsn GlyProI*eu ThrGlnArgTyr ArglleGly PheArgTyr AlaSerThrVal 

1684 GACTTCGACT TCTTCGTGAG CCGCGGCGGC ACCACCGTGA ACAACTTCCG CTTCCTGCGC 

AspPheAsp PhePheVal SerArgGlyGly ThrThrVal AsnAsnPhe ArgPheLeuArg 

1744 ACCATGAACA GCGGCGACGA GCTGAAGTAC GGCAACTTCG TGCGCCGCGC CT7CACCACC 
ThxMetAsn SerGlyAsp GluLeuLysTyr GlyAsnPhe ValArgArg AlaPheThrThr 

1804 CCCTTCACC7 7CACCCAGA7 CCAGGACA7C A7CCGCACCA GCA7CCAGGG CC7GAGCGGC 
ProPhe7lir PheThrGln IleGlnAspIle IleArgTlir SerlleGln GlyLeuSerGly 

■ 1864 AACGGCGAGG TGTACATCGA CAAGA7CGAG A7CA7CCCCG 7GACCGCCAC C77CGAfiGCC 
AsnGlyGlu ValTyrlle AspLysIleGlu IlellePro ValThrAla ThrPheGluAla 

1924 GAG7ACGACC TGGAGCGCGC CCAGGAGGCC G7GAACGCCC 7GT7CACCAA CACCAACCCC 
Glu7yrAsp LeuGluArg AlaGlnGluAia ValAsnAla LeuPheThr AsnThrAsnPro 

1984 CGCCGCCTGA AGACCGACGT GACCGACTAC CACATCGACC AGGTGAGCAA CCTGGTGGCC 
ArgAxgLeu LysTlirAsp ValThrAspTyr HisIleAsp GlnValSer AsnLeuValAla 

^20AA TGCCTGAGCG ACGAGTTCTG CCTGGACGAG AAGCGCGAGC TGCTGGAGAA GGTGAAGTAC 
CysLeuSer AspGluPhe CysLeuASF>Glu LysArgGlu LeoLeuGlu LysValLysTyr 
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2104 GCCAAGCGCC TGAGCGACGA GCGCAACCTG CTGCAGGACC CCAACTTCAC CAGCATCAAC 
AlaLysArg LeuSerAsp GluAxgAsnLeu LeuGlnAsp ProAsnPhe ThrSerlleAsn 

2164 AAGCAGCCCG ACTTCATCAG CACCAACGAG CAGAGCAAC7 TCACCAGCAT CCACGAGCAC 
LysGlnPro AspPhelle SerThrAsnGlu GlnSerAsn PheThrSer IlaHisGluGln 

2224 AGCGAGCACG GCTGGTGGGG CAGCGAGAAC ATCACCATCC AGGAGGGCAA CGACGTGTTC 
SerGluHis GlyTrpTrp GlySerGluAsn IleThrlle GlnGluGly AsnAspValPhe 

2284 AAGGAGAACT ACGTGACCCT GCCCGGCACC TTCAACGAGT GCTACCCCAC CTACC7GTAC 
LysGluAsn TyrValThr LeuProGlyThr PheAsnGlu CysTyrPro ThrTyrLeuTyr 

2344 CAGAAGATCG GCGAGAGCGA GCTGAAGGCC TACACCCGCT ACCAGCTGCG CGGCTACATC 
GinLysIle GlyGluSer GluLeuLysAIa TyrThrArg TyrGinLeu ArgGlyTyrlle 

2404 GAGGACAGCC AGGACCTGGA GATC7ACCTG ATCCGCTACA ACGCCAAGCA CGAGACCCTG 
GluAspSer GlnAspLeu GluIleTyrLeu IleArgTyr AsnAlaLys HisGluThrLeu 

24 64 GACGTGCCCG GCACCCSAGAG CCTGTGGCCC CTGAGCGTGG AGAGCCCCAT CGGCCGCTGC 
AspValPro GlyThrGlu SerLeuTrpPro LeuSerVal GluSerPro IleGlyArgCys 

2524 GGCGAGCCCA ACCGCTGCGC CCCCCACTTC GAGTGGAACC CCGACCTGGA CTGCAGCTGC 
GlyGluPro AsnArgCys AlaProfiisPhe GluTrpAsn ProAspLeu AspCysSerCys 

2584 CGCGACGGCG AGAAGTGCGC CCACCACAGC CACCACTTCA GCCTGGACAT CGACGTGGGC 
AxgAspGly GluLysCys AlaHisHisSer HisHisPhe SerLeuAsp IleAspValGly 

2 644 TGCACCGACC TGCACGAGAA CCTGGGCGTG TGGGTGGTG7 TCAAGATCAA GACCGAGGAG 
CysThrAsp LeuflisGlu AsnI*euGlyVal TrpValVal PheLysIle LysThrGlnGlu 

2704 GGCCACGCCC GGCTGGGCAA CCTGGAGTTC A7CGAGGAGA AGCCCCTGC7 GGGCGAGGCC 
GlyHisAla ArgLeuGly AsnLeuGluPhe IleGluGlu LysProLeu LeuGlyGluAla 

2764 C7GAGCCGCG 7GAAGCGCGC CGAGAAGAAG 7GGCGCGACA AGCGCGAGAA GC7GCAGCTG 
LeuSerArg ValLysArg AlaGluLysLys 7ipArgAsp LysArgGlu LysLeuGlnLeu 

2824 GAGACCAAGC GCGTG7ACAC CGAGGCCAAG GAGGCCG7GG ACGCCC7G77 CG7GGACAGC 
Glu7hrLys ArgVal7yr 7hrGluAlaLys GluAiaVal AspAlaLeu PheValAspSer 

2884 CAG7ACGACC GCC7GCAGGC CGACACCAAC A7CGGCA7GA 7CCACGCCGC CGACAAGCTG 
Gin7yrAsp ArgLeuGln AlaAspThrAsn IleGlyMet IlefliaAia AiaAspLysLeu 

2944 G7GCACCGCA 7CCGCGAGGC C7ACCTGAGC GAGC7GCCCG 7GA7CCCCGG CG7GAACGCC 
ValHisArg IleArgGlu AlaTyrL-euSer GluLeuPro ValllePro GlyValAsnATa 

3004 GAGA7C77CG AGGAGC7GGA GGGCCAC:A7C ATCACCGCCA TCAGCG7G7A CGACGCCCGC 
GiuIlePhe GluGlxiLeu GluGlyflisIle IleThrAla IleSerJUau TyrAspAlaArg 
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3064 AACGTGGTGA AGAACGGCGA CTTCAACAAC GGCCTGACCT GCTGGAACGT GAAGGGCCAC 
AsnValVal LysAsnGly AspPheAsnAsn GlyLeuTlir CysTrpAsn VaiLysGlyHis 

3124 GTGGACGTGC AGCAGAGCCA CCACCGCAGC GACCTGGTGA TCCCCGAGTG GGAGGCCGAG 
ValAspVal GlnGlnSer HisflisArgSer AspLeuVal IleProGlu TrpGluAiaGlu 

3184 GTGAGCCAGG CCGTGCGCGT GTGCCCCGGC TGCGGCTACA TCCTGCGCGT GACCGCCTAC 
ValSerGln AlaValArg ValCysProGly CysGiyTyr Ilel-euArg ValTlirAlaTyr 

3244 AAGGAGGGCT ACGGCGAGGG CTGCGTGACC ATCCACGAGA TCGAGAACAA CACCGACGAG 
LysGluGly TyrGlyGlu GlyCysValThr IleHisGlu IleGluAsn AsnTlirAspGlu 

3304 CTGAAGTTCA AGAACCGCGA GGAGGAGGAG GTGTACCCCA CCGACACCGG CACC7GCAAC 
IieuLysPhe LysAsnArg GluGluGluGlu ValTyrPro ThrAspThr GI yThrCysAsn 

33 64 GACTACACCG CCCACCAGGG CACCGCCGGC TGCGCCGACG CCTGCAACAG CCGCAACGCC 

AspTyrThr AlaHisGln GlyThrAlaGly CysAlaAsp AlaCysAsn SerArgAsnAla 

3424 GGCTACGAGG ACGCCTACGA GGTGGACACC ACCGCCAGCG TGAACTACAA GCCCACCTAC 
GlyTyrGlu AspAlaTyr GluVaLAspThr ThrAlaSer ValAsnTyr LysProThrTyr 

34 84 GAGGAGGAGA CCTACACCGA CGTGCGCCGC GACAACCIACT GCGAG7ACGA CCGCGGCTAC 

GluGluGlu ThrTyrThr AspValArgArg AspAsnHis CysGluTyr AspArgGlyTyr 

3544 GTGAACTACC CCCCCG7GCC CGCCGGCTAC GTGACCAAGG AGCTGGAGTA CTTCCCCGAG 
ValAsnTyr ProProVal ProAIaGlyTyr VaiThrLys GluLeuGlu TyrPheProGlu 

3604 ACCGACACCG TGTGGATCGA GATCGGCGAG ACCGAGGGCA AGTTCATCGT GGACAGCGTG 
TlirAspThr ValTrpIle GluIleGlyGlu ThrGluGiy LysPhelle ValAspSerVal 

3664 GAGC7GCTGC TGATGGAGGA GTAG 
GluLeuLeu LexiMetGlu Glu 
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FIG. 7 

SSQUENCE OF THE FULL-LENGTH HYBRID SYNTHETIC/NATIVE CRYLA{B) CHIMERIC GENE 
The fusion point between the synthetic and native coding sequences is 
indicated by a slash (/) in the sequence, 

1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAfi 
MetAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGlu 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ValGluVal LeuGlyGly GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLeu 

121 AGCCTGACCC AGTTCCTGC7 GAGCGAGTTC G7GCCCGGCG CCGGCTTCGT GC7GGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe ValProGly AlaGlyPbe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAG7GGG ACGCCTTCCT GG7GCAGATC 

VaiAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GluGlnLeu IleAsnGln ArglleGluGlu PheAiaArg AsnGlnAla IleSerArgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr AlaGluSer PheArgGlu TrpGluAlaAsp 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAX GAACAGCGCC 
ProThrAsn ProAlaLeu ArgGluGluMet ArglleGln PheAsnAsp HetAsnSerAla 

421 CTGACCACCG CCATCCCCCT GTTCGCCG7G CAGAAC7ACC AGG7GCCCCT GCTGAGCGTG 
LeuThrThr AlallePro LeuPheAlaVal GlnAsnTyr GlnValPro LeuLeuSerVal 

4 81 TACG7GCAGG CCGCCAACC7 GCACC7GAGC G7GC7GCGCG ACG7CAGCGT G7TCGGCCAG 
TyrValGln AlaAlaAsn LeuHisLeuSer ValLeuArg AspValSer ValPheGlyGLn 

541 CGC7GGGGC7 TCGACGCCGC CACCATCAAC AGCCGC7ACA ACGACCTGAC CCGCCTGATC 
AxgTrpGly PheAspAla AlaThrlleAsn SerArgTyr AsnAspLau 7hrArgLeuIle 

601 GGCAAC7ACA CCG2>iCCXCGC CG7GCGCTGG TACAACACCG GCCTGGAGCG CGTG7GGGGT 
GlyAsnTyr ThrAspHis AlaVaLArgTrp TyrAsnThr GlyLeuGlu ArgValTrpGly 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC CAG77CCGCC GCGAGCTGAC CC7GACCGTG 
ProAspSer ArgAspTrp IleArgTyrAsn GlnPheArg ArgGluLeu ThrLeuThrVal 

' 721 C7GGACA7CG TGAGCC7GTT CCCCAAC7AC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
LeuAspIle ValSerLeu PheProAanTyr AspSerArg 7hrTyrPro IleArgThrVal 

- 781 AGCCAGC7GA CCCGCGAGAT TTACACCAAC CCCGTGC7GG AGAAC7TCGA CGGCAGC7TC 
SerGlnLeu ThrArgGlu IleTyrThrAsn ProValLeu GluAsnPhe AspGlySerPhe 

^841 CGCGGCAGCG CCCAGGGCA7 CGAGGGCAGC A7CCGCAGCC CCC:ACCTGA7 GGACA7CCTG 
ArgGlySer AiaGlnGly IleGluGlySer IleArgSer PrcHisLeu MetAspIleLeu 

901 AACAGCATCA CCATC7ACAC CGACGCCCAC CGCGGCGAG7 AC7AC7GGAG CGGCCACCAC 
AsnSerlle ThrlleTyr ThrAspAlaHis ArgGiyGlu TyrTyrTr? SerGlyHisGln 
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961 ATCATGGCCA GCCCCGTCGG CTTCAGCGGC CCCGAGTTCA CCTTCCCCC7 GTACGGCACC 
IleMetAla SerProVal GlyPheSerGly ProGluPhe ThrPhePro LeuTyrGlyThr 

1021 ATGGGCAACG CTGCACC7CA GCAGCGCATC GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle ValAlaGln LeuGlyGln GlyValTyrArg 

1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVal 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCC7CAGAA CAACAACGTG 
TyrArgLys SerGlyTUr VaJLAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

1261 CCACCTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATGT7CCG CAGTGGCTTC 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAAC\GCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGATTCA CCGCAGTGCC 

SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAla 

1381 GAG7TCAACA ACA7CA7CCC CAGCAGCCAG A7CACCCAGA 7CCCCC7GAC CAAGAGCACC 
GluPheAsn Asnllelle ProSerSerGln Ile71irGln IleProLeu 7hrLysSerTlir 

1441 AACC7GGGC\ GCGGCACCAG CG7GG7GAAG GGCCCCGGC7 7CACCGGCGG CGACATCC7G 
AsnLeuGiy SerGly7hr SerValValLys GlyProGly ?he7hrGly GlyAspIleLeu 

1501 CGCCGCACCA GCCCCGGCCA GA7CAGCACC C7GCGCG7GA ACA7CACCGC CCCCC7GAGC 
ArgArg7Iir SerProGly GlnIleSer7hr LeiiArgVal Asnlle7hr AlaProLeuSer 

1561 CAGCGC7ACC GCG7CCGCA7 CCGC7ACGCC AGCACCACCA ACC7GCAG77 CCACACCAGC 
GlxiArgTyr ArgValArg IleArgTyrAla Ser7hrThr AsxiLeuGln PheHisThrSer 

1621 A7CGACGGCC GCCCCA7CAA CCAGGGCAAC 77CAGCGCCA CCA7GAGCAG CGGCAGCAAC 
IleAspGly ArgProIle AsnGlnGlyAsn PheSerAla ThrMetSei: SerGlySerAsn 

1681 C7GCAGAGCG GCAGC7TCCG CACCG7GGGC 77CACC:ACCC CC77C:AAC77 CAGCAACGGC 
LeuGlnSer GlySerPtie Arg7hrValGly Phe7iir7hr PrqPheAsn PheSerAsnGly 

; 1741 AGCAGCG7G7 7CACCC7GAG CGCCCACG7G T7C\ACAGCG GCAACGAGG7 G7ACATCGAC 
SerSerVal Phe7hrLeu SerAlaHisVal PheAsnSer GlyAsnGlu Val7yrIleAsp 

1801 CGCA7CGAG7 7CG7GCCCGC CGAGG7GACC 77CGAGGCCG AG7ACGACC7 GGAGAGGGCT 
ArglleGlu PheValPro AlaGluVal7hr PheGluAla Glu7yrAsp LeuGluArgAla 

18 61 C^G^GGCCG 7GAACGAGC7 G77CACCAGC AGCAACCAGA 7CGGCC7GAA GACCGACGTG 
GlnLysAla VaLAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu Lys7hrAspVal 
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1921 ACCGACTACC ACATCGAT/CA AGTATCCAAT TTAGT7GAGT G7TTATCTGATGAATT7TGT 
ThrAspTyr HialleAsp/GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1981 CTGGATGAAA AAAAAGAATT GTCCGAGAAA GTCAAACATG CGAAGCGACT TAGTGATGAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 

2041 CGGAATTTAC TTCAAGATCC AAACTTTAGA GGGATCAATA GACAACTAGA CCGTGGCTGG 

ArgAsxiLeu LeuGlnAsp ?roAsnPheArg GlylleAsn ArgGlnLeu AspArgGlyTrp 

2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAATTACGTT 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVal 

2161 ACGCTATTGG G7ACCTTTGA TGAG7GCTAT CCAACG7A77 7A7A7CAAAA AA7AGA7GAG 
ThrLeuLeu Gly71xrPhe AspGluCysTyr Pro7hrTyr LeuTyrGln LysIleAspGlu 

2221 TCGAAA77AA AAGCC7A7AC CCG77ACCAA 77AAGAGGG7 A7A7CGAAGA TAG7CAAGAC 
SerLysLeu LysAlaTyr 7hrArg7yrGln LeuArgGly 7yrIleGlu AspSerGlnAsp 

2281 T7AGAAA7C7 A777AA77CG C7ACAA7GCC AAACACGAAA CAG7AAATG7 GCCAGG7ACG 
LeuGluIle 7yrLeuIle Arg7yrAsnAla LysHisGlu 7hrValAsn ValProGlyThr 

2341 GG77CC77A7 GGCCGC777C AGCCCCAAG7 CCAA7CGGAA AA7G7GCCC::A 7CA77CCC:AT 
GlySerLeu 7rpProLeu SerAlaProSer ProIleGly LysCysAla HisHisSerHis 

24 01 CA777C7CC7 7GGACA77GA 7G77GGA7G7 ACAGAC77AA A7GAGGAC77 AGG7G7A7GG 
HisPheSer LeuAspIle AspValGlyCys 7hrAspLeu AsnGluAjsp LeuGiyVal7rp 

24 61 G7GA7A77CA AGA77AAGAC GCAAGA7GGC CA7GCAAGAC TAGGAAA7C7 AGAA777C7C 
ValliePhe LysIleLys ThrGlxiAspGly HisAlaArg LeuGlyAsn LeuGluPheLeu 

2521 GAAGAGAAAC C:A77AG7AGG AGAAGCAC7A GC7CG7G7GA AAAGAGCGGA GAAAAAA7GG 
GluGluLys ProLeuVal GlyGluAlal/eu AlaAxgVal LysArgAla GliiLysLys7rp 

2581 AGAGACAAAC G7GAAAAA77 GGAA7GGGAA ACAAA7A77G 777A7AAAGA GGCAAAAGAA 
ArgAspLys ArgGluLys LeuGlu7rpGlu 7hrAsnIle Val7yrLys GliiAlaLysGlu 

2641 7C7G7AGA7G C777A777G7 AAAC7C7CAA 7A7GA7AGA7 7ACAAGCGGA 7ACCAACA7C 
SerValAsp AlaLeuPhe ValAsnSerGln 7yrAspArg LeuGlnAla Asp7hrAsnIle 

2701 GCGA7GA77C A7GCGGCAGA TAAACGCG77 CATAGCA7TC GAGAAGC77A 7C7GCC7GAG 
AlaMetlle HisAlaAla AspLysArgVal HisSerlle AxgGluAia TyrLeuProGlu 

2761 C7G7C7G7GA 77CCGGG7G7 CAA7GCGGC7 A777T7GAAG AA77AGAAGG GCG7A7777C 
LeuSerVal IleProGly ValAsnAlaAla IlePheGlu GluLeuGlu GlyArgllePhe 

2821 AC7GCA77C7 CCC7A7A7GA 7GCGAGAAA7 G7CA77AAAA A7GG7GA777 TAA7AA7GGC 
7hrAIaPhe SerLeu7yr AspAlaArgAsn VallleLys AsnGlyAsp PheAsnAjsnGly 
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2881 TTATCCTGCT GGAACGTGAA AGGGCATGTA GATGTAGAAG AACAAAACAA CCACCGTTCG 
LeuSerCys TrpAsnVal LysGiyHisVal AspValGlu GluGlnAsn AsnflisArgSer 

2941 G7CCTTGTTG TTCCGGAATG GGAAGCAGAA GTGTCACAAG AAGTTCGTGT CTGTCCGGGT 
ValLeuVal ValProGlu TrpGluAlaGlu ValSerGln GluVaLArg ValCysProGly 

3001 CGTGGCTATA TCCTTCGTGT CACAGCGTAC AAGGAGGGAT ATGGAGAAGG TTGCG7AACC 
AxgGlyTyr IleLeuArg ValThrAlaTyr LysGluGly TyrGlyGlu GlyCysValThr 

30 ei ATTCATGAGA TCGAGAACAA 7ACAGACGAA CTGAAGTTTA GCAACTGTGT AGAAGAGGAA 
IleHisGlu IleGluAsn AsnThrAspGlu LeuLysPhe SerAsnCys ValGluGluGlu 

3121 G7ATATCCAA ACAACACGG7 AACG7G7AA7 GA77A7AC7G CGAC7CAAGA AGAA7A7GAG 
Val7yrPro AsnAsn7hr Val7hxCysAsn Asp7yrThr Ala7hrGln GluGlu7yrGlu 

3181 GG7ACG7ACA C77C7CG7AA 7CGAGGA7A7 GACGGAGCC7 A7GAAAGCAA 77C77C7G7A 
Gly7hr7yr 7hrSerArg AsnArgGly7yr AspGlyAla TyrGluSer AsnSerSerVal 

3241 CCAGC7GA77 A7GCA7CAGC C7A7GAAGAA AAAGCA7A7A CAGA7GGACG AAGAGACAAT 
ProAlaAsp 7yrAlaSer Ala7yrGluGlu LysAla7yr 71irAspGly ArgArgAspAsn 

3301 CC77G7GAA7 C7AACAGAGG A7A7GGGGA7 7ACACACCAC 7ACCAGC7GG C7A7G7GACi 
ProCysGlu SerAsnArg Gly7yrGlyAsp 7yr7hrPro LeuProAla GlyTyrValThr 

3361 AAAGAA77AG AG7AC77CCC AGAAACCGA7 AAGG7A7GGA 77GAGA7CGG AGAAACGGAA 
LysGluLeu Glu7yrPhe ProGlu7JxrAsp LysVal7rp IleGluIle GlyGlu7hrGlu 

34 21 GGAACA77CA 7CG7GGACAG CG7GGAA77A C77C77A7GG AGGAA7AA 
Gly7hrPhe IleValAsp SerValGluLeu LeuI-euMet: GluGlu 
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FIG. 9 



1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA- ACTGCCTGAG CAACCCCGA3 
MecAspAsn AsnProAsn IleAsnGluCys lieProTyr AsnCysLeu SerAsnProGlu 

6i GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CA7CAGCCTG 
VaiGiuVal LeuGlyGly GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLeu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe ValProGly AlaGlyPhe ValLeuGiyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
VaiAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuVaiGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCA7 CAGCCGCCTG 
GluGinLeu IleAsnGln ArglleGluGlu PheAlaArg AsnGlnAla IleSerArgLeu 

3C1 GAGGGCCTGA GCAACCTGTA CCAAATC7AC GCCGAGAGCT TCCGCGAG7G GGAGGCCGAC 
GiuGiyLeu SerAsnLeu 7yrGlnIleTyr AlaGluSer PheArgGlu 7rpGluAlaAs? 

561 CCCACCAACC CCGCCC7GCG CGAGGAGA7G CGCA7CCAG7 TCAACGACA7 GAACAGCGCC 
?ro7hrAsn ProAlaLeu ArgGluGluMet ArglleGln PheAsnAsp MetAsnSerAla 

421 C7GACCACCG CCA7CCCCC7 G77CGCCG7G CAGAAC7ACC AGG7GCCCC7 GC7GAGCG7G 
Leu7hrThr AlallePro LeuPheAlaVal GlnAsn7yr GlnValPro LeuLeuSerVal 

481 7ACG7GCAGG CCGCCAACC7 GCACC7GAGC G7GC7GCGCG ACG7CAGCG7 G77CGGCCAG 
TyrValGlr. AlaAlaAsn LeuHisLeuSer ValLeuArg AspVaiSer ValP^^eGlyGlr. 

54 1 CGC7GGGGC7 7CGACGCCGC CACCA7C-\AC AGCCGCTACA ACGACC7GAC CCGCC7GA7: 
Arg7rpGiy PheAspAla Aia7hrIleAsn SerArg7yr AsnAspLeu ThrArgLeuIle 

5C1 GGCAAC7ACA CCGACCACGC CG7GCGC7GG 7ACAACACCG GCC7GGAGCG CG7G7GGGGT 
GiyAsn7yr 7hrAspHis AlaVaiArg7rp 7yrAsn7hr GiyLauGlu ArgVal7rpGly 

6 61 CCCGACAGCC GCGAC7GGA7 CAGG7ACAAC CAG77CCGCC GCGAGC7GAC CC7GACCG7G 
ProAspSer ArgAsp7rp IleArg7yrAsn GlnPheArg ArgGluLeu 7hrLeu7hrVa: 

111 C7GGACA7CG 7GAGCC7G77 CCCCAAC7AC GACAGCCGCA CC7ACCCCA7 CCGCACCG7G 
LeuAspIle VaiSerLeu PheProAsn7yr AspSerArg 7hr7yrPro IleArg7hrVa: 

781 AGCGAGC7GA CCCGCGAGA7 77ACACCAAC CCCG7GC7GG AGAAC77CaA CGGCAGC7TC 
SerGlnLeu 7hrArgGlu Ile7yr7nrAsn ProValLeu GluAsnPhe AspGiySerPhe 

841 CGCGGCAGCG CCCAGGGCA7 CGAGGGCAGC A7CCGCAGCC CCCACC7GAT GGACA7CC7G 
ArgGlySer AlaGlnGly IleGluGlySer IleArgSer ProHisLeu MenAspIleLeu 

901 AACAGCA7CA CCATC7ACAC CGACGCCCAC CGCGGCGAG7 AC7AC7GGAG CGGCCACCAG 
AsnSerlle 7hrlle7yr 7hrAspAlaHis ArgGlyGlu 7yr7yr7rp SerGlyHisGln 

961 ATCA7GGCGA GCCCCG7CGG C77CAGCGGC CCCGAG77CA CC77CCCCC7 G7ACGGCACC 
IleMetAia SerProVai GlyPheSerGly ProGluPhe 7hr?he?ro Leu7yrGlyThr 
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rIG. ? (CONT) 

ICZ: ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGG TGGGCCAGGG AGTG7ACCGC 
MetGlyAsn AiaAlaPro GinGiaArglie VaiAiaGln LeuGlyGin GlyVaiTyrArg 

1C51 ACCCTGAGCA GCACCC7GTA CCGTCGACCT TTCAACATCG GCA7CAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylieAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGC3CCGTG 
SerVaiLeu AspGlyThr GluPheAlaTyr GiyThrSer SerAsnLeu ProSerAlaVal 

1201 TACCGCAAGA GCGGCACCGT GGACAGCC7G GACGAGA7CC CCCC7CAGAA CAACAACG7G 
7yrArgLys 5erGly7hr VaiAspSerLeu AspGluIle ProProGln AsaAsnAsnVal 

12 61 CCACC7CGAC AGGGC77CAG CCACCG7C7G AGCCACG7GA GCA7G77CCG CAG7GGC77C 
ProProArg GinGiyPhe^ SerHisArgLeu SerHisVal SerMecPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCG7GAGCA7 CA7CCG7GCA CC7A7G7TCA GC7GGA77CA CCGCAG7GCC 
SerAsnSer SerValSer IlelieArgAia ProMetPhe Ser7rplle HisArgSerAla 

1351 GAG77CAACA ACATCA7CCC CAGCAGCCAG A7CACCCAGA 7CCCCC7GAC CAAGAGCACG 
GluPheAsn Asnllelle ProSerSerGIn Ile7hrGin IleProLeu 7hrLysSerThr 

14 41 AACC7GGGCA GCGGCACCAG CG7GG7GAAG GGCCCCGGCT 7CACCGGCGG CGACATCC7G 

AsnLeuGly SerGly7hr SerValValLys GlyProGly PheThrGly GlyAspIleLeu 

15C1 CGCCGCACCA GCCCCGGCZA GA7CAGCACC C7GCGCG7GA ACA7CACCGC CZZCZZGAGC 
ArgArgT^.^ SerPrcGly GinIleSer7hr LeuArgVal Asnlle7hr AlaPrcLauSer 

15 61 CAGCGC7ACC GCG7CCGCA7 CCGC7ACGCC AGCACCACCA ACC7GCAG77 CCACACCAGC 

GlnArg7yr ArgVaiArg IieArg7yrAla Ser7hrThr AsnLeuGln ?heHis7hrSer 

1521 A7CGACGGCC GCCCCA7CAA CCAGGGCAAC 77CAGCGCCA CCA7GAGCAG CGGCAGCAAC 
IleAspGly ArgProIle AsnGlnGlyAsn PheSerAla 7hrMetSer SerGIySerAsn 

1631 C7GCAGAGCG. GCAGC77CCG CACCG7GGGC T7CACCACCC CG77CAAC77 CAGCAACGGC 
LeuGlnSer GlySerPhe Arg7hrValGly Phe7hr7hr ProPheAsn PheSerAsnGly 

1141 AGCAGCG7G7 7CACCC7GAG CGCCCACG7G 7TCAACAGCG GCAACGAGG7 G7ACATCGAC 
SerSerVal ?he7r:rLeu SerAlaHisVal PheAsnSer GlyAsnGlu Val7yrIleAsp 

_ 18C1 CGCA7CGAG7 7CG7GCCGGC CGAGG7GACC 77CGAGGCCG AG7ACGACC7 GGAGAGGGC7 
ArglleGiu PheValPro AlaGluVai7hr PheGiuAla Glu7yrAsp LeuGl^oArgAla 

1861 CAGAAGGCCG 7GAACGAGC7 G77CACCAGC AGCAACCAGA 7CGGCC7GAA GACCGACG7G 
GlnLysAia ValAsnGiu LeuPhe7hrSer SerAsnGln lieGlyLeu Lys7hrAspV'al 

1921 ACCGAC7ACC ACA7CGA7CA AG7A7CCAA7 77AG77GAG7 G777A7C7GA 7GAA7T77G7 
7hrAsp7yr HisIleAsp GinValSarAsn LeuVaiGlu CysLeuSer AspGluPheCys 

1981 C7GGA7GAAA AAAAAGAA77 G7CCGAGAAA G7CAAACA7G CGAAGCGAC7 7AG7GA7GAG 
Le'jAspGlu LysLysGlu LeuSerGluLys ValLysHis AiaLysArg LeuSerAspGlu 
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FIG. 9 (CONT) 

'f 
< 

2G41 CGGAATTTAC TTCAAGATCC AAACTTTAGA GGGATCAATA -GACAACTAGA CCG7GGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn ArgGinLeu AspArgGlyTrp 

2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAAT7ACG7T 
ArgGiySer ThrAspIle ThrlieGlnGiy GlyAspAsp VaiPheLys GluAsnTyrVal 

2161 ACGCTATTGG GTACCTTCGA CGAG7GCTAC CCCACC7ACC 7G7ACCAGAA GA7CGACGAG 
7hrLeuLeu Giy7hr?he AspGluCys7yr ?ro7hr7yr Leu7yrGln LysIieAspGlu 

2221 AGCAAGC7GA AGGCC7ACAC CCGC7ACCAG C7GCGCGGC7 ACA7CGAGGA CAGCCAGGAC 
SerLysLeu LysAlaTyr 7hrArgTyrGln LeuArgGly TyrlleGlu AspSerGinAs? 

22 31 C7GGAAA7C7 ACC7GA7CCG C7ACAACGCC AAGCACGAGA CCG7GAACG7 GCCCGGCACC 
LeuGiuIle 7yrLeuIle Arg7yrAsnAla LysHisGlu 7hrValAsn VaiProGly7hr 

22 41 GGCAGCG7G7 GGCCCC7GAG CGCCCCCAGC CCCA7CGGCA AG7GCGGGGA GCCGAA7CGA 
GlySerLeu 7rpProLeu SerAiaProSer ProIleGly LysCysGly GluProAsnArg 

2411 TGCGC7CCGC ACC7GGAG7G GAACCCGGAC C7AGAC7GCA GC7GCAGGGA CGGGGAGAAG 
CysAiaPro HisLeuGiu 7rpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

24 61 7GCGCCCACC ACAGCCACCA C77CAGCC7G GACA7CGACG 7GGGCTGCAC CGACC7GAAC 

CysAiaHis HisSerHis HisPheSerLeu AspIleAsp ValGiyCys 7hrAspLeuAsn 

2521 GAGGACC7GG GCG7G7GGG7 GA7C77CAAG A7CAAGACCC AGGACGGCCA CGCCCGCC7G 
GluAspLeu GlyVai7rp VaillerheLys IleLys7yir GinAspGly HisAlaArgLe. 

25 31 GGCAA7C7AG AA777C7CGA AGAGAAACCA 77AG7AGGAG AAGCAC7AGC 7CG7G7GAAA 

GlyAsnLeu GiuPheLeu GluGluLysPro LeuVaiGly GluAlaLeu AiaArgVaiLys 

2 641 AGAGCGGAGA AAAAA7GGAG AGACAAACG7 GAAAAA77GG AA7GGGAAAC AAA7A77G77 
ArgAlaGiu LysLys7rp ArgAspLysArg GluLysLeu Giu7rpGlu 7hrAsnIleVal 

2701 7A7AAAGAGG CAAAAGAA7C 7G7AGA7GC7 77A777G7AA AC7C7CAA7A 7GA7AGA77A 
7yrLysGlu AlaLysGlu SerValAspAla LeuPheVal AsnSerGin 7yrAspArgLeu 

27 61 CAAGCGGA7A CCAACA7CGC GA7GA77CA7 GCGGCAGA7A AACGCG77CA 7AGCA77CGA 
GlnAiaAsp 7hrAsnIle AlaMetlleHis AlaAlaAsp LysArgVal HisSerlleArg 

,2821 GAAGC77A7C 7GCC7GAGC7 G7C7G7GA77 CCGGG7G7CA A7GCGGC7A7 7777GAAGAA 
Gi^aAla7yr LeuProGlu LeuSerVallle ProGlyVal AsnAlaAia IlePheGluGlu 

2 881 77AGAAGGGC G7A7T77CAC 7GCA77C7CC C7A7A7GA7G CGAGAAA7G7 CA77AAAAA7 
LeuGluGly ArgllePhe 7hrAlaPheSer Leu7yrAsp AiaArgAsn VailleLysAsn 

2941 GG7GA7777A A7AA7GGC77 A7CC7GC7GG AACG7GAAAG GGCA7G7AGA 7G7AGAAGAA 
^ GlyAspPhe AsnAsnGly LeuSerCys7rp AsnVaiLys GlyHisVal AspValGluGlu 

2001 CAAAACAACC ACCG77CGG7 CC77G77G77 CCGGAA7GGG AAGCAGAAG7 G7CACAAGAA 
GinAsnAsn HisArgSer ValLeuVaiVal ?roGiu7rp GiuAlaGiu ValSerGinGlu 
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FIG. 9 (CONT) 



30 61 GTTCGTGTCT GTCCGGGTCG TGGCTATATC CTTCGTGTCA CAGCGTACAA GGAGGGATAT 
VaiArgVal CysProGiy ArgGlyTyrlle LeuArgVal ThrAiaTyr LysGluGlyTyr 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC GAGAACAATA CAGACGAACT GAAG7TTAGC 
GlyGluGly CysValThr IleHisGluIls GluAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTGTAATGA TTATACTGCC- 
AsnCysVai GluGluGlu ValTyrProAsn AsnThrVal ThrCysAsn AspTyrThrAIa 

3241 ACTCAAGAAG AATATGAGGG TACGTACAC? TCTCGTAATC GAGGATATGA CGGAGCCTAT 
ThrGlnGIu GluTyrGlu GlyThrTyrThr SerArgAsn ArgGlyTyr AspGlyAlaTyr 

3301 GAAAGCAATT CTTCTG7ACC AGCTGA7TAT GCATCAGCCT ATGAAGAAAA AGCATATAC\ 
GluSerAsn SerSerVal PrcAlaAspTyr AiaSerAla TyrGluGiu LysAlaTyrThr 

33 61 GA7GGACGAA GAGACAATCC 77G7GAA7C7 AACAGAGGA7 A7GGGGA77A CACACCAC7A 

AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly 7yrGlyAsp 7yrThrProLeu 

3 4 21 CCAGC7GGC7 •A7G7GACAAA AGAA77AGAG TAC77CCCAG AAACCGA7AA GG7A7GGA7T 
ProAlaGly 7yrVal7hr LysGluLeuGlu 7yrPhePro GluThrAsp LysVal7rpIle 

34 81 GAGA7CGGAG AAACGGAAGG AACA77CA7C G7GGACAGCG 7GGAA77AC7 7C77A7GGAG 

GluIleGly GluThrGlu Gly7hrPheIle VaiAspSer ValGluLeu LeuLeuMetGlu 

3541 GAA7AA 
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rlG. 1 



i ATGGACAACA ACCCCAACAT CAACGAGTGC 
MecAspAsn AsnProAsn IleAsnGiuCys 

1 GTGGAGGTGC TGGG CATCGAG 
ValGluVai LeuGlyGly GluArglleGlu 

1 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC 
SerLeuThr GlnPheLeu LeuSerGiuPhe 

1 GTGGACATCA 7CTGGGGCAT CTTCGGCCCC 
ValAspIle IleTrpGly IlePheGlyPro 

1 GAGCAGCTGA TCAACCAGCG ' CA7CGAGGAG 
GluGinLeu IleAsnGln ArglleGiuGlu 

1 GAGGGCC7GA GCAACCTGTA CCAAATCTAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr 

i lCwACwAACC CCG^— cctgcg cgaggagatg 
PrOihrAsn ProAiaLeu ArgGluGluMet 

1 ci:gkczkczz ccatccccct gttcgccgtg 

LeuThrThr AlallePro LeuPheAIaVal 

1 TACGTGCAGG CCGCCAACCT GCACCTGAGC 
TyrVaiGln AiaAlaAsn LeuHisLeuSer 

ArgTrpGly PheAspAla AiaThrlleAsn 

1 GGCAACTACA CCGACCACGC CG7GCGC7GG 
GlyAsn7yr 7hrAspHis AiaVaiArg7rp 

1 CCCGACAGCC GCGAC7GGA7 CAGG7ACAAC 

ProAspSer Ar9Asp7rp IieArg7yrAsn 

1 C7GGACA7CG 7GAGCG7G77 CCCCAAC7AC 
LeuAspIie VaiSerLeu PheProAsn7yr 

1 AGCCAGCTGA CCCGCGAGA7 77ACACCAAC 
SerGinLeu 7hrArgGlu Ile7yrThrAsn 

1 CGCGGCAGCG CCCAGGGCA7 CGAGGGCAGC 
ArgGlySer AlaGlnGly IleGluGlySer 

1 AACAGCA7CA CGA7C7ACAC CGACGCCCAC 
AsnSerlie 7hrlle7yr 7hrAspAlaHis 

1 A7CA7GGCGA GCCCCG7CGG C77CAGCGGC 
IleMetAIa SerProVai GlyPheSerGly 



A7CCCC7ACA AC7GCC7GAG CAACCCCGA; 
IlePro7yr AsnCysLeu SerAsnProGl 

ThrGiy7yr 7hrProIle AspIleSerLe: 

G^f^f^^^^t^^^ f^f^f*^f^rr%tft^r^^ f^^mm^^^f^f^^m* 

VaiProGly AlaGlyPhe VaiLeuGlyLe*. 

AGCCAG7GGG ACGCC77CC7 GG7GCAGA7: 
SerGln7rp AspAlaPhe LeuValGlnlle 

77CGCCCGCA ACCAGGCCA7 CAGCZGCCTZ 
PheAiaArg AsnGlnAia IleSerArgLeu 

AlaGiuSer PheArgGlu 7rpGluAiaA£z 
CGCA7CCAG7 7CAACGACA7 GAACAGCGCZ 

ArglleGln PheAsnAsp MetAsnSerAla 

CAGAAC7ACC AGG7GCCCC7 GC7GAGCG73 
GlnAsn7yr GlnValPro LeuLeuSerVal 

r'm^f^mr^f^r^t^r* '\r*r^*^r"\f^^'^^ /-•m#T»/^'*i^r^/** • 

ValLeuArg AspVaiSer ValPheGlyGl:. 

SerArg7yr AsnAspLeu 7hrArgLeuIIe 

ThCAACACCG GCC7GGAGCG CG7G7GGGGT 
7yrAsn7hr GlyLeuGlu ArgVai7rpGly 

CAG77CCGCC GCGAGC7GAC CC7GACCG7G 
GlnPheArg ArgGluLeu 7hrLeu7hrVal 

GACAGCCGCA CC7ACCCCA7 CCGZACCGTZ 
AspSerArg 7hr7yrPro IleArg7hrVal 

CCCG7GC7GG AGAAC77CGA CGGCAGC7TC 
ProVaileu GluAsnPhe AspGlySerPhe 

A7CCGCAGCC CCCACG7GA7 GGACATCC7G 
IleArgSer PrcHisLeu MetAspIleLeu 

GAG7 AC7AC7GGAG CGGCCACCAG 
ArgGlyGlu TyrTyrTrp SerGlyHisGln 

CCCGAG77CA CC77CCCCC7 G7ACGGCACC 
ProGluPhe 7hr?hePro Leu7yrGiy7hr 



^1/94 



wo 93/07278 A Cr/L'S92/08476 




FIG. 11 (CDNT) 

' 1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGChCAGC TGGGCC?^GGG AGTGTACCGC 
Met:GlyAsn AlaAiaPro GInGlnArglle ValAiaGln LeuGlyGln GiyVaiTyrArg 

1081 ACCCTGAGCA GCACCCTGTA CCG7CGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGinGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVai 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA C2^CAACGTG 
TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

12 51 CCACGTCGAC AGGGCTTCAG CCACCG7CTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerVaiSer lielieArgAia ProMetPhe SerTrpIle HisArgSerAla 

1351 GAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
GluPheAsn Asnllelie ProSerSerGin IleThrGin IleProLeu ThrLysSerThr 

AsnLeuGly SerGlyThr SerValValLys GlyProGly PheThrGly GlyAspIleLeu 

1301 CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCGTGAGC 
ArgArgThr SerProGiy GinlieSerThr LeuArgVai AsnlleThr AlaPrcLeuSer 

1561 CAGCGCTACC GCGTCCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAG: 
GlnArgTyr ArgValArg IleArgTyrAia SerThrThr AsnLeuGin PheHisThrSer 

1621 ATCGACGGCC GCGCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
lieAspGiy ArgProIle - AsnGlnGlyAsn PheSerAla ThrMetSer SerGlySerAsn 

1681 CTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACGGC 
LeuGlnSer GlySerPhe ArgThrValGly PheThrThr ProPheAsn PheSerAsnGly 

1741 AGCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGG7 GTACA7CGAC 
SerSerVal PheThrLeu SerAlaHisVai PheAsnSer GlyAsnGlu ValTyrlleAsp 

1801 CGCA7CGAG7 7CG7GCCCGC CGAGG7GACC' 77CGAGGCCG AG7ACGACC7 GGAGAGGGC7 
- ArglleGlu PheValPro AlaGluVai7hr PheGluAla GluTyrAsp LeuGluArgAla 

1861 C^iGAAGGCCG TGAACGAGC7 G77CACCAGC AGCAACCAGA TCGGCCTGAA GACCGACGTG 
GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspVal 

1921 ACCGAC7ACC ACA7CGATCA GGTGAGCAAC C7GG7GGAG7 GC7TAAGCGA CGAG7TC7GC 
ThrAspTyr HisIleAsp GlnValSerAsn LeuVaiGlu CysLeuSer AspGluPJieCys 

1981 C7GGACGAGA AGAAGGAGC7 GAGCGAGAAG G7GAAGCACG CCAAGCGCC7 GAGCGACGAG 
LeuAspGiu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 
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FIG. i: (CCNT) 

' 2041 CGCAACCTGC TGCAGGACCC CAACTTCCGC GGCATCAACC GCGAGCTGGA CCGCGGCTGG 
ArgAsnLeu LeuGinAsp ProAsnPheArg GlylleAsn ArgGinLeu AspArgGiyTrp 

' 2101 CGAGGCAGCA CCGATATCAC CATCCAGGGC GGCGACGACG TGTTCAAGGA GAAC7ACGTG 
ArgGlySer ThrAspIle ThrlleGlnGly GiyAspAsp ValPheLys Gl'jAsnTyrVai 

2161 ACCGTGCTGG GCACCTTCGA CGAGTGCTAC CCCACCTACC TG7ACCAGAA GATCGACGAG 
ThrLeuLeu GlyThrPhe AspGluCysTyr PrcThrTyr LeuTyrGln LysIleAspGlu 

2221 AGCAAGCTGA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGly TyrlleGlu AspSerGlnAsp 

2281 CTGGAAATCT ACCTGA7CCG CTACAACGCC AAGCACGAGA CCGTGAACGT GCGCGGCACC 
LeuGiuIle TyrLeuIle ArgTyrAsnAla LysHisGlu ThrVaiAsn VaiPrcGlyThr 

2241 GGCAGCCTGT GGCCCCTGAG CGCCCCCAGC CGCATCGGCA AGTGCGGGGA GCGGAATCGA 
GlySerLeu TrpProLeu SerAlaProSer ProIieGiy LysCysGly GiuProAsnArg. 

24 CI TGCGCTCCGC ACCTGGAGTG GAACCCGGAC CTAGACTGCA GCTGCAGGGA CGGGGAGAAG 
CysAlaPro HisLeuGlu TrpAsnProAsp LeuAspCys SerCysArg AspGiyGluLys 

24 61 TGCGCCCACC ACAGCCACCA C7TCAGCCTG GACA7CGACG 7GGGC7GCAC CGACC7GAAC 
CysAlaHis HisSerHis HisPheSerLeu AspIleAsp ValGlyCys 7^.rAspLeuAsn 

2521 GAGGACC7GG GCG7G7GGG7 GATC7TCAAG A7CAAGACCC AGGACGGCCA CGCGGGCC7G 
GluAspLeu GlyVclTrp VallLerheLys IleLysThr GlnAspGiy HisAlaArgLeu 

2581 GGCAATC7AG AA777C7CGA AGAGAAACCA T7AG7AGGAG AAGCACTAGC TCGTG7GAAA 
GlyAsnLeu GiuPheLeu GluGiuLysPro LeuVaiGiy GluAiaLeu AlaArgValLys 

2 641 AGAGCGGAGA AAAAA7GGAG AGACAAACG7 GAAAAAi7GG AA7GGGAAAC AAA7A77G77 
ArgAiaGlu LysLys7rp ArgAspLysArg GluLysLeu Glu7rpGlu 7hrAsnIleVai 

2701 7A7AAAGAGG CAAAAGAA7C TG7AGATGC7 T7A777G7AA AC7C7CAA7A TGATAGA77A 
TyrLysGiu AlaLysGlu SerVaiAspAla LeuPheVai AsnSerGln TyrAspArgLeu 

27 61 CAAGCGGA7A CC:AACA7CGC GA7GA77CA7 GCGGCAGA 7A AACGCG77CA TAGCAr7CGA 
GlnAiaAsp 7hrAsnIle AlaMerlleHis AiaAlaAsp LysArgVal HisSerlleArg 

2821 GAAGC77A7C TGCC7GAGC7 G7C7G7GA77 CCGGG7G7CA A7GCGGC7A7 T777GAAGAA 
: GluAia7yr LeuPraGlu LeuSerVallle ProGlyVal AsnAlaAla IlePheGluGlu 

2881 T7AGAAGGGC G7A7T77C1AC 7GCA77C7CC C7A7A7GA7G CGAGAAA7G7 CA7TAAAAAT 
LeuGluGly ArgllePhe 7hrAlaPheSer Leu7yrAsp AiaArgAsn VailleLysAsn 

2941 GG7GA7777A A7AA7GGC77 A7CC7GC7GG AACG7GAAAG GGCA7G7AGA 7G7AGAAGAA 
GlyAspPhe AsnAsnGly LeuSerCys7rp AsnValLys GlyHisVal AspValGiuGlu 

3001 CAAAACAACC ACCG77CGG7 CC77G77G77 CCGGAA7GGG AAGCAGAAG7 G7CACAAGAA 
GlnAsnAsn HisArgSer VaiLeuVaiVal Pr^GluTrp Gl'jAlaGlu ValSerGlnGlu 
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30 61 GTTCGTGTCT GTCCGGGTCG TGGCTATATC CTTCGTGTCA. CAGCGTACAA GGAGGGATA7 
ValArgVal CysProGly ArgGlyTyrlie LeuArgVal ThrAlaTyr LysGluGlyTyr 

3121 GGAGAAGGTT GCG7AACCAT TCATGAGATC GAGAACAA7A CAGACGAACT GAAG7TTAGC 
GlyGluGly CysValThr IleHisGluIle GiuAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTG7AATGA 77A7AC7GCG 
AsnCysVal GluGluGlu Val7yrProAsn Asn7hrVal ThrCysAsn Asp7yrThrAla 

3241 AC7CAAGAAG AA7A7GAGGG 7ACGTACAC7 7CTCG7AA7C GAGGATA7GA CGGAGCC7AT 
7hrGlnGlu Glu7yrGlu Gly7hrTyrThr SerArgAsn ArgGly7yr AspGlyAla7yr 

3301 GAAAGCAA77 C77C7G7ACC AGC7GATTA7 GCA7CAGCC7 A7GAAGAAAA AGCA7ATACA 
GluSerAsn SerSerVai ?roAiaAsp7yr AlaSerAia 7yrGluGlu LysAla7yr7hr 

3361 GA7GGACGAA GAGACAA7CC 77G7GAA7C7 AACAGAGGA7 A7GGGGAT7A CACACCAC7A 
■ AspGlyArg ArgAspAsn ProCysGiuSer AsnArgGly TyrGlyAsp 7yrThr?roLeu 

3 421 ■ CCAGCTGGC7 ATGTGACAAA AGAA77AGAG TACTTCCCAG .W^CCGA7AA GG7A7GGATT 
ProAlaGiy 7yrVai7hr LysGluLeuGiu TyrPhePro Glu7hrAsp LysVai7rpIIe 

34 81 GAGA7CGGAG AAACGGAAGG AACA77CA7C G7GGACAGCG TGGAATTAC7 7C7TATGGAG 
GluIleGly Glu7hrGlu Gly7hrPheIie VaiAspSer ValGluLeu LeuLeuMetGlu 

3541 GAA7AA 
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FIG, 13 



1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCGCGTACA ACTGCCTGAG CAACCCCGAG 
MecAspAsn AsnProAsn IleAsnGIuCys IleProTyr AsnCysLeu SerAsnProGlu 

51 GTGGAGG7GC TGGGCGGCGA GCGCA7CGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
VaiGluVal LeuGlyGiy GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLeu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCT7CGT GCTGGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe VaiProGly AiaGlyPhe VaiLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
ValAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCGGCCTG 
GluGlnLeu IleAsnGln ArglleGluGlu PheAlaArc AsnGlnAia IleSerArgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC GCCGAGAGCT TCCGCGAG7G GGAGGCCGAC 
GluGlyLeu SerAsnLeu 7yrGlnIle7yr AlaGluSer PheArgGlu TrpGIuAlaAsp 

3 51 CCCACCAACC CCGCCC7GCG CGAGGAGA7G CGCA7CCAG7 7CAACGACA7 GAACAGCGCC 
ProThrAsn ProAlaLeu ArgGluGluMet ArglleGln PheAsnAsp MecAsnSerAla 

421 C7GACCACCG CC:A7CCCCC7 G77CGCCG7G CAGAAC7ACC AGG7GCCCC7 GC7GAGCG7G 
Leu7hr7hr AlallePro LeuPheAlaVai GlnAsn7yr GlnValPro LeuLeuSerVal 

481 rACG7GCAGG CCGCCAACC7 GCACC7GAGC G7GC7GCGCG ACG7CAGCG7 G77CGGCCAG 
TyrValGln AlaAlaAsn LeuHisLeuSer ValLeuArg AspVaiSer VaiPheGiyGin 

54 1 CGC7GGGGC7 TCGACGCCGC CACGA7CAAC AGCCGC7ACA ACGACC7GAC CCGCC7GA7C 
Arg7rpGiy PheAspAla Ala7hrIleAsn SerArg7yr ' AsnAspLeu ThrArgLeuIle 

601 GGCAAC7ACA CCGACCACGC CG7GCGC7GG TACAACACGG GCC7GGAGCG CG7G7GGGG7 
GiyAsn7yr 7hrAspHis AlaValArgTr? 7yrAsn7hr GlyLeuGiu ArgVai7rpGly 

661 CCCGACkGCZ GCGAC7GGA7 CAGG7ACAAC CAG77CCGCC GCGAGC7GAC CC7GACCG7G 
ProAspSer ArgAspTrp IleArg7yrAsn GlnPheArg ArgGluLeu 7hrLeu7hrVal 

'21 C7GGACA7CG 7GAGCC7G77 CCGCAAC7AC GACAGCCGCA CC7ACCCCA7 CCGCACCG7G 
LeuAspIle VaiSerLeu PheProAsn7yr AspSerArg 7hr7yrPro IleArg7hrVal 

781 AGCGAGC7GA CCCGCGAGA7 77ACACCAAC CCCG7GC7GG AGAAC77CGA CGGCAGC77C 
SerGinLeu 7hrArgGlu Iie7yr7hrAsn ProValLeu GluAsnPhe AspGlySerPhe 

341 CGCGGCAGCG CCCAGGGCA7 CGAGGGCAGC A7CCGCAGCC CCCACC7GA7 GGACA7CC7G 
ArgGlySer AlaGlnGly IleGluGlySer IleArgSer ProHisLeu MecAspIleLeu 

901 AACAGCA7CA CCA7C7ACAC CGACGCCCAC CGCGGCGAG7 AC7AC7GGAG CGGCCACCAG 
AsnSerlle 7hrIleTyr 7hrAspAlaHis ArgGlyGlu 7yr7yr7rp SerGlyHisGln 

961 ATCATGGCCA GCCZCG7CGG C77CAGCGGC CCCGAG77CA CC77CGCCC7 GTACGGCACC 

Ile.MetAia SerProVai GlyPheSerGly ProGluPhe 7hrPhePro Leu7yrGly7hr 
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1C21 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCCAGGG AGTGTACCGC 
Met.GlyAsn AiaAiaPro GinGinArglie valAlaGin LeuGlyGin GiyVaiTyrArg 

' 1081 ACCC-:G?iGC2i GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACC7GCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu PrsSerAlaVal 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACG7G 
TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

12 61 CCACCTCGAC AGGGCTTCAG CGACCGTCTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 

ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMecPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CGTATGTTCA GCTGGATTCA CGGCAGTGCC 
'SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIle HisArgSerAia 

1381 GAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 

GluPheAsn Asnllelie ProSerSerGln IleThrGIn IleProLeu ThrLysSerThr 

1441 AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGC7 TCACCGGCGG CGACATCCTG 
AsnLeuGly SerGlyThr SerValValLys GlyProGly PheThrGly GlyAspIleLeu 

1501 CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCGTGAGC 
ArciAraThr SerrroGly GlnlleSerThr Le-^^^rgVal AsnlleThr AlaPrcLeuSer 

:5£1 CAGCGCTACG GCGTCCGCAT CCGCTACGCC AGCACGACCA ACCTGCAGTT CCACACCAGC 
GlnArgTyr ArgValArg IleArgTyrAla SerThrThr AsnLeuGln PheHisThrSer 

1521 ATCGACGGCC GCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
IleAspGIy ArgProIle AsnGinGlyAsn PheSerAia ThrMetSer SerGlySerAsn 

1581 CTGCAGAGCG GCAGC7TCCG CACCGTGGGC TTCACGACCC CCTTCAACTT CAGCAACGGC 
LeuGlnSer GlySerPhe ArgThrValGly PheThrThr ProPheAsn PheSerAsnGly 

1741 AGCAGCGTGT TCACCCTGAG CZCZZACZTG TTCAACAGCG GCAACGAGGT GTACATCGAC 
SerSerVai PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu VaiTyrlleAsp 

1301 CGCATCGAGT TCG7GCCZGC CGAGGTGACG TTCGAGGCCG AGTACGACCT GGAGAGGGC? 
^ ArglleGlu PheValPro AlaGluValThr PneGluAia GluTyrAsp LeuGl'jArgAla 

13 61 CAGAAGGCCG TGAACGAGCT GT7CACCAGC AGCAACCAGA 7CGGCC7GAA GACCGACGTG 

GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu Lys7hrAspVal 

1921 ACCGAC7ACC ACATCGA7CA GG7GAGCAAC C7GG7GGAG7 GC77AAGCGA CGAG77CTGC 
ThrAspTyr HisIleAsp GlnValSerAsn LeuValGlu CysLeuSer AspGIuPheCys 

1981 C7GGACGAGA AGAAGGAGC7 GAGCGAGAAG GTGAAGCACG CCAAGCGCC7 GAGCGACGAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGiu 

47/94 



wo 93/07278 



CT/ US92/08476 



FIG. 13 (CONT) 

.c 

2041 CGCAACCTGC TGCAGGACCC CAACTTCCGC GGCATCAACC GCZAGCTGG?^ CCGCGGCTGG 
ArgAsnLeu LeuGinAsp ProAsnPheArg GlylieAsn ArgGlnLeu AspArgGiyTrp 

^ 2101 CGAGGCAGCA CCGATATGAC CATCCAGGGC GGCGACGACG TGTTCAAGGA GAACTACGTG 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVal 

2161 ACCCTGCTGG GCACCTTCGA CGAGTGCTAC CCCACC7ACC TGTACCAGAA GA7CGACGA3 
ThrLeuLeu GlyThrPhe AspGiuCysTyr ProThrTyr LeuTyrGln LysIleAspGIu 

2221 AGCAAGCTGA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGAC 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGly TyrlleGlu AspSerGlnAs? 

22 81 CTGGAAATCT ACCTGATCCG CTACAACGCC AAGCACGAGA CCG7GAACG7 GCCCGGCJiCZ 
LeuGluIle 7yrLeuIle Arg7yrAsnAla LysHisGlu 7hrValAsn VaI?roGly7hr 

2 341 GGCAGCCTG7 GGCCCCTGAG CGCCCCCAGC CCCA7CGGCA AG7GCGGGGA GCCGAA7CGA 
GlySerLeu TrpProLeu SerAlaProSer ProIleGiy LysCysGly GluProAsnArg 

2 401 ZGCGC7CZGC ACC7GGAG7G GAACCCGGAC C7AGAC7GCA GC7GCAGGGA CGGGGAGAAG 
CysAiaPro HisX-euGlu TrpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

2 4 61 7GCGCZZACC ACAGCCACCA C77CAGCC7G GACATCQACG 7GGGC7GCAC CGACCTGAAC 
CysAlaHis HisSertiis HisPheSerLeu AspIleAsp ValGlyCys 7hrAspLeuAsn 

2 521 GAGGACCTGG GCG7G7GGG7 GATC7TCAAG ATCAAGACCC AGGACGGCCA CGCCCGCCTG 
GluAspLeu GlyVal7r? ValllePheLys IleLys7rir GlrJispGiy HisAlaArgLeu 

GiyAsnLeu GluPheLeu GluGluLysPro LeuVaiGly GluAlaLeu AlaArgValLys 

2 641 CGCGCCGAGA AGAAG7GGCG CGACAAGCGC GAGAAGC7GG •AG7GGGAGAC CAACA7CG7G 
ArgAlaGlu LysLysTrp ArgAspLysArg GluLysLeu GluTrpGlu 7hrAsnIleVal 

27 01 TACAAGGAGG CCAAGGAGAG CG7GGACGCC C7G77CG7GA ACAGCCAG7A CGACGGCC7G 
TyrLysGlu AlaLysGlu SerValAspAla LeuPheVal AsnSerGln 7yrAspArgLeu 

27 61 CAGGCCGACA CCAACA7CGC CA7GA7CCAC GZZGGZG7<GK AGCGCG7GCA CAGCA7TCGC 
GlnAlaAsp ihrAsnlle AiaMetll.eHis AlaAlaAsp LysArgVai HisSerlleArg 

9^. uAbowwiACv^ i bu^«^sjAb^ i. bAVj\«o^oAlU LwwvjvjuoibA Abvjv»w(jv«v,A« biiUbAoUAA 

GluAlaTyr LeuProGlu LeuSerVallie ProGlyVal AsnAlaAla IlePheGluGlu 

_ 2381 C7CGAGGGGC GCA7C77CAC GGCC77CAGC C7G7ACGACG CCCGCAACG7 GA7CAAGAAC 
\ LeuGiuGly ArgllePhe ThrAiaPheSer LeuTyrAsp AiaArgAsn VailleLysAsn 

2 941 GGCGACTTCA ACAACGGCC7 GAGCTGCTGG AACG7GAAGG GCGACG7GGA CG7GGAGGAG 
GlyAspPhe AsnAsnGly LeuSerCysTr? AsnValLys GlyHisVal AspValGluGlu 

3001 CAGAACAACC ACCGCAGCG7 GC7GG7GG7G CCCGAG7GGG AGGCCGAGG7 GAGCCAGGAG 
GinAsnAsn HisArgSer ValLeuValVai ProGluTrp GiuAlaGlu VaiSerGlnGlu 
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VaiArgVal CysProGly ArgGiyTyrlie LeuArgVai ThrAiaTyr LysGiuGiyTyr 

3121 GGCGAGGGCr GCGTGACCAT CCACGAGATC GAGAACAACA CCGACGAGCT CAAGTTCAGC 
GlyGluGly CysValThr IleHisGluIie GluAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGCGTGG AGGAGGAGGT GTACCCCAAC AACACCG7GA CCTGCAACGA CTACACCGCG 
AsnCysVal GluGluGlu ValTyrProAsn AsnThrVal ThrCysAsn AspTyrThrAia 

3241 ACCCAGGAGG AGTACGAGGG CACCTACACC AGCCGCAACC GCGGCTACGA CGGCGCCTAC 
ThrGlnGlu GluTyrGlu GlyThrTyrThr SerArgAsn ArgGlyTyr AspGlyAlaTyr 

3 301 GAGAGCAACA GCAGCGTGCC CGCCGACTAC GCCAGCGCCT ACGAGGAGAA GGCCTACACC 
GluSerAsn SerSerVai ProAlaAspTyr AlaSerAia TyrGIuGlu LysAiaTyrThr 

3361 GACZZCZGCZ GCGACAACCC CTGCGAGAGC ?JkCZGCZGC1: ACGGCGACTA CACCCCCCTG 
AspGiyArg ArgAspAsn ProCysGluSer AsnArgGly TyrGlyAsp TyrThrProLeu 

3421 CCCGCCGGC? ACGTGACCAA GGAGCTGGAG 7ACTTCCCCG AGACCGACAA GGTGTGGATC 
ProAiaGiy TyrVaiThr LysGiuLeuGlu TyrPhePro GluThrAsp LysVaiTrpIle 

34 81 GAGATCGGCG AGACCGAGGG CACCTTCATC Q7GQ?iC?iGCQ TGGAGCTGCT GCTGATGGAG 

GluIleGiy GluThrGlu GlyThrPhelle VaiAspSer ValGluLeu LeuLeuMerGlu 

35 41 GAGTAG 
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: ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA- ACTGCCTGAG CAACCCCGA3 
MecAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGlj 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCT3 
ValGluVal LeuGlyGiy GluArglleGlu ThrGlyTyr ThrProIle AspIleSerLeu 

121 AGCCTGACCC AGTTCCTGCT GAGCGAG7TC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerleuThr GlnPheLeu LeuSerGluPhe VaiProGly AlaGlyPhe ValLeuGiyLeu 

181 GTGGACATCA TCTGGGGCA7 CTTCGGCCCC AGCCAGTGGG ACGCCTTCC7 GGTGCAGATC 
ValAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG 7TCGCCCGCA ACCAGGCCA7 CAGCGGCC7G 
GluGlnLeu IleAsnGln ArglleGluGlu PheAlaArg AsnGlnAia IleSerArgLe- 

301 GAGGGCC7GA GCAACC7G7A CCAAA7C7AC GCCGAGAGC7 7CCGCGAG7G GGAGGCCGAC 
GluGlyLeu SerAsnLeu 7yrGlnIle7yr AlaGiuSer PheArgGlu 7rpGluAlaA£r 

3 61 CCCACCAACC CZGCCZTGCG CGAGGAGA7G CGCA7CCAG7 7CAACGACA7 GAACAGCGCZ 
?ro7hrAsn ProAlaLeu ArgGluGluMec ArglleGIn PheAsnAsp MecAsnSerAla 

421 C7GACCACCG CCA7CCCCC7 G77CGCCG7G CAGAAC7ACC AGG7GCCCC7 GC7GAGCG7G 
Leu7hr7hr AlallePro LeuPheAlaVal GlnAsn7yr GlnValPro LeuLeuSerVal 

481 TACG7GCAGG CGGCCAACC7 GCACC7GAGC G7GC7GCGCG ACG7CAGCG7 G77CGGCCA3 
TyrValGl- AlaAlaAsn LeuriisleuSer Vd.LeuArg AspValSer ValPheGlyGl" 

54 1 CGC7GGGGC7 7CGACGCCGC . CACGA7CAAC AGCCGC7ACA ACGACC7GAC CCGCG7GA7C 
Arg7rpGiy PheAspAla Aia7hrIieAsn SerArg7yr AsnAspLeu 7hrArgLeuIle 

501 GGCAAC7ACA CCGACCACGC CG7GCGC7GG 7ACAACACCG GCC7GGAGCG CG7G7GGGG7 
GlyAsn7yr 7hrAspHis AiaVaiArg7rp 7yrAsn7hr GlyLeuGlu ArgVai7rpGiy 

661 CCCGACAGCC GCGAC7GGA7 CAGG7ACAAC CAG77CCGCC GCGAGC7GAC CC7GACCG7G 
PrcAspSer ArgAsp7r? IleArg7yrAsn GlnPheArg ArgGluLeu 7hrLeu7hrVal 

721 C7GGACA7CG 7GAGCC7G77 CCCGAAC7AC GACAGCCGCA CC7ACCCGA7 CGGCACCG7G 
LeuAspIle ValSerLeu PheProAsn7yr AspSerArg 7hr7yrPro IleArg7hrVal 

781 AGCCAGC7GA CGCGCGAGA7 77ACACCAAC CCCG7GC7GG AGAACT7CGA CGGCAGC77C 
SerGlnLeu 7hrArgGlu Ile7yr7hrAsn ProValLeu GiuAsnPhe AspGlySerPhe 

^ 841 CGCGGCAGCG CCCAGGGCA7 CGAGGGZ?.GZ A7CCGCAGCC CCCACC7GA7 GGACA7CC7G 
ArgGlySer AlaGlnGiy lieGluGlySer IleArgSer ProHisLeu MetAspIleLeu 

901 AACAGCA7CA CCA7C7ACAC CGACGCCCAC CGCGGCGAG7 AC7ACTGGAG CGGCGACCAG 
AsnSerlle 7hr:ie7yr 7hrAspAlaHis ArgGlyGlu 7yr7yr7rp SerGiyHisGln 

961 A7CA7GGCCA GCCCCG7CGG C77CAGCGGC CCCGAG77CA CC77CCCCC7 G7ACGGCACC 
lie.MetAla SerProVai GlyPheSerGiy ProGluPhe 7hrPhePro Leu7yrGly7hr 
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1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCGAGGG AGTGTACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle ValAiaGin LeuGlyGln GlyValTyrAr? 

1081 ACCCTGAGCA GCACCCTGTA CCG7CGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGinGlr*Le*j 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA GCAACZ7GCZ CAGCGCCGTG 

SerValLeu AspGlyThr GluPheAiaTyr GlyThrSer SerAsnLeu ProSerAlaVal 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAAC\ACGTG 
TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

12 61 CCACCTCGAC AGGGCTTCAG CCACCG7CTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCG7GCA CCTATG7TCA GC7GGA77CA CCGCAG7GC: 
SerAsnSer SerVaiSer IlelieArgAia ProMecPhe SerTrpIle HisArgSerAla 

1381 GAGT7CAACA ACA7CA7CCC CAGCAGCCAG ATCACCCAGA TCCCCC7GAC CAAGAGCAC: 

GluPheAsn Asnllelle ProSerSerGln Ile7hrGln IleProLeu ThrLysSerThr 

1441 AACC7GGGCA GCGGCACCAG CG7GG7GAAG GGCCCCGGC7 7CACCGGCGG CGACA7CCTG 
AsnLeuGly SerGly7hr SerValValLys GlyProGly ?he7hrGly GlyAspIleLeu 

15C1 CGCCGCACCA GCCCCGGCCk GATCAGCACC C7GCGCG7GA ACA7CACCGC CCCCC7GAGC 
ArgArgry-.r SerPrrGly GlnlleSerThr leuArgVal AsnlleThr AlaPrcLeuSer 

15 51 CAGCGC7ACG GCG7CCGCA7 CCGC7ACGCC AGCACCACCA ACC7GCAG7T CCACACCAGC 
GinArg7yr ArgValArg IleArg7yrAla SerThr7hr AsnLeuGln PheHisThrSer 

1621 A7CGACGGCC GCCCGA7CAA GCAGGGCAAC TTCAGCGCCA CCA7GAGCAG CGGCAGCAAC 
lieAspGly ArgProIle AsnGlnGlyAsn PheSerAia ThrMetSer SerGlySerAsr. 

1681 CTGCAGAGCG GCAGC77CCG CACCG7GGGC TTCACCACCC CC77CAAC7T CAGCAACGGC 
LauGlnSer GlySerPhe Arg7hrVal.Gly ?heThr7hr ProPheAsn PheSerAsnGly 

1741 AGCAGCG7G7 7CACCC7GAG CGCCCACG7G T7CAACAGCG GCAACGAGG7 G7ACA7CGAC 
SerSerVal ?he7hrLeu SerAiaHisVal PheAsnSer GlyAsnGlu Val7yrIleAsp 

1801 CGCATCGAG7 7CG7GCGCGC CGAGG7GACC T7CGAGGCCG AG7ACGACC7 GGAGAGGGC7 
ArglieGlu PheValPro AlaGluValThr PheGluAia Glu7yrAsp LeuGluArgAla 

1861 CAGAAGGCCG 7GAACGAGC7 G77CACCAGC AGCAACCAGA 7CGGCCTGAA GACCGACG7G 
GlnLysAla ValAsnGlu LeuPhe7hrSer SerAsnGln IleGlyLeu LysThrAspVal 

1921 ACCGAC7ACC ACA7CGA7CA AG7A7CCAAT T7AG77GAG7 G777A7CTGA 7GAA7777G7 
7hrAsp7yr HisIleAsp GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1981 C7GGA7GAAA AAAAAGAAT7 G7CCGAGAAA G7CAAACA7G CGAAGCGAC7 7AG7GA7GAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLy sHis AlaLysArg LeuSerAspGlu 
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2041 CGGAATTTAC TTCAAGATCC AAAC7TTAGA GGGATCAATA GACAACTAGA CCG'TGGCTZG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn ArcGinLeu AspArgGlyTrp 

' 2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAATTACGT7 
ArgGlySer ThrAspIie ThrlleGlnGly GlyAspAsp VaiPheLys GluAsnTyrVal 

2161 ACGCTATTGG GTACCTTTGA TGAGTGCTAT CCAACGTATT TATATCAAAA AATAGATGAG 
ThrLauLeu GiyThrPhe AspGluCysTyr ProThrTyr LeuTyrGln LysIleAspGlu 

2221 TCGAAATTAA AAGCCTATAC CCGTTACCAA TTAAGAGGG7 A7ATCGAAGA 7AG7CAAGAC 
SerLysLeu LysAlaTyr 7hrArgTyrGin LeuArgGly 7yrIleGlu AspSerGinAsp 

2281 77AGAAA7C7 A777AA77CG C7ACAA7GCC AAACACGAAA CAG7AAA7G7 GCCAGG7ACG 
LeuGluIle 7yrLeuIle ArgTyrAsnAla LysHisGlu 7hrValAsn ValProGlyThr 

2341 GG77CC77A7 GGCCGC777C AGCCCCAAG7 CCAA7CGGCA AG7GCGGGGA GCCGAA7CGA 
GlySerLeu 7rpProLeu SerAlaProSer ProIleGly LysCysGly GluProAsnArg 

2 4C1 rGCGC7CCGC ACC7GGAG7G GAACCCGGAC C7AGAC7GCA GC7GCAGGGA CGGGGAGAAG 
CysAlaPro HisLeuGiu TrpAsnProAsp LeuAspCys SerCysArg AspGiyGluLys 

24 61 TGCGCZCkCC ACAGCCACCA C77CAGCC7G GACA7CGACG 7GGGC7GCAC CGACC7GAAC 
CysAiaHis HisSerHis HisPheSerLeu AspIleAsp ValGlyCys 7hrAspLe\iAsn 

I-luA^pLe-j GlyVal7rp ValllePhalys ::elys7hr GlnAspGiy HisAlaArgLe-.: 

2 5 21 GGCAATC7AG AA777C7CGA AGAGAAACCA 77AG7AGGAG AAGCAC7AGC 7CG7G7GAAA 
GlyAsnLeu GluPheLeu GIuGluLysPro LeuValGly GluAiaLeu AlaArgValLys 

2 541 AGAGCGGAGA AAAAA7GGAG AGACAAACG7 GAAAAA77GG AA7GGGAAAC AAA7A77GT7 
ArgAiaGlu LysLys7rp ArgAspLysArg GluLysLeu Giu7rpGlu 7hrAsnIleVal 

2 701 7A7AAAGAGG CAAAAGAA7C 7G7AGA7GC7 77A777G7AA AC7C7CAA7A 7GA7AGA77A 
7vrLvsGLu AlaLysGiu SerValAscAla LeuPheVal AsnSerGin 7vrAscAraLe\i 

27 61 CAAGCGGA7A CGAACA7CGC GA7GA77CA7 GCGGCAGA7A AACGCG77CA 7AGCA7TCGA 

GlnAlaAsp 7hrAsnIie AlaMetlleHis A:.aAlaAsp LysArgVal HisSerlleArg 

^0*7* ^ •% f ^^r^ -y ff^ ^^^^•^/••^j^ — m »^/^^» ^/-» > rr>r^ .mM<^/>«^^f^^m mrf<»w«M^^m/^>« 

: Gl'aAla7yr LeuProGiu LeuSerVaille ProGlyVal AsnAlaAla IlePheGluGlu 

2381 77AGAAGGGC G7A777TCAC 7GCA77C7CC C7A7A7GA7G CGAGAAA7G7 CA77AAAAAT 
LeuGluGly ArgllePhe 7hrAlaPhe5er Leu7yrAsp AlaArgAsn VallleLysAsn 

2 941 GG7GA7777A A7AA7GGC77 A7CC7GC7GG AACG7GAAAG GGCA7G7AGA 7G7AGAAGAA 
GlyAspPhe AsnAsnGly LeuSerCys7rp AsnValLys GlyHisVai AspValGluGlu 

3001 CAAAACAACC ACCG77CGG7 CC77G77G77 CCGGAA7GGG AAGCAGAAG7 G7CACAAGAA 
Glr.AsrJVsn HisArgSer ValLeuValVal ?roGiu7rp GluAlaGlu ValSerGlnGlu 
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FIG. 15 (CONT) 

c 

3061 GT7CGTG7CT GTCCGGGTCG TGGCTATA7C CT7CGTGTCA CAGCG7ACAA GGAGGGA7Ar 
VaiArgVai CysProGly ArgGlyTyrlle LeuArgVai ThrAiaTyr LysGIuGly7yr 

3121 GGAGAAGG7T GCGTAACCAT TCATGAGATC GAGAACAATA CAGACGAACT GAAGTTTAGC 
GlyGluGiy CysValThr lieHisGluIle GluAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGTG7AG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTGTAATGA TTATAC7GC3 
AsnCysVal GluGluGlu VaiTyrProAsn AsnThrVal ThrCysAsn AspTyrThrAla 

32 41 ACTCAAGAAG AATATGAGGG TACGTACACT TCTCGTAATC GAGGATATGA CGGAGCCTAT 

ThrGlnGlu GluTyrGlu GlyThrTyrThr SerArgAsn ArgGlyTyr AspGlyAlaTyr 

3301 GAAAGCAATT CTTCTGTACC AGCTGATTAT GCATCAGCCT ATGAAGAAAA AGCATATACA 
GluSerAsn SerSerVal ProAlaAspTyr AlaSerAla TyrGluGlu LysAiaTyrThr 

3 3 61 GATGGACGAA GAGACAATCC TTGTGAA7CT AACAGAGGA7 A7GGGGA77A CACACCAC7A 
AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly TyrGlyAsp 7yr7hrProLe- 

34 21 CCAGC7GGC7 A7G7GACAAA AGAA77AGAG 7AC77CCCAG AAACCGA7AA GG7A7GGAr- 
ProAlaGly 7yrVai7hr LysGluLeuGlu 7yrPhePro Glu7hrAsp LysVal7rpIle 

34 81 GAGA7CGGAG AAACGGAAGG AACA7tCA7C G7GGACAGCG 7GGAA77AC7 7C77A7GGAG 
GluIleGly Glu7hrGlu Gly7hrPheIle ValAspSer ValGluLeu LeuLeuMetGlu 

3541 GAA7 AAG 
" , - 



54/94 



wo 93/07278 PCT/US92/08476 



TIC. 16 




U3 ^3 



55/94 



wo 93/07278 



/US92/08476 



FIG. 1 



5232,NcoI 
4979,StuI 
4811,XbaI 
4805,Sair 
4799,Pstr 
4793,SphI . 
4787,HindIII ■ 



5808,BamHI 
5666,EcoRV 



EcoN 1,375 
AatII,520 




Ball,965 
NcoU019 
HincII,1103 
Sall,1103 



Apal,1470 

Pstl,1603 
Pstl,168l 

BstEII,1824 

Glal,1933 
BgiII,1955 
Clal,2042 
Kpnl,2146 
£coRL2152 
Ndel,2365 
Aatll,2617 
Sspl,2733 
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PGR fragment replacing Pith Nco I (ATG start) 

with Bam HI, Pith-5^-GGATCCAACA(ATG)-3^-svn.-Bt 




7273,NruI 



6678.HindIlI 
6641. PstI 



5369.BamHI 



6232,XbaI 
6226.SalI 
6220,Pstl 
208,HindIII 



?stl,1603 
PstLl681 

BstEILl824 



35S 3' 



pCIB4420 

8319 bps 



PepC Ivs«9 
3SS 3' 



5730,SalI 




EcoRI,21S2 
N'deI,236S 
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rIG. 23A 

CrvlA(b) Protein Levels in Transgenic Maizg 

ELISA 3t Values o< Reld Plants: . 

INBRED X PARENT ABRU PLANT Number ng BVmg protein 





1 AAA 




^NQfl4,y 1 71 -AA 


fl^7 


1 / 05 


wI>i30*»A 1 i 1 "*»M 


0 /U 


1 /60 


ONyO^'A 1 / i-io 




22 




1 a-oB 


17 


5N984X171-15 


1470 


28 


5N984X171-14A 


1502 


180 


SN984X171-UA 


1529 


1500 


5N984X1 76-11 


1667 


408 


5N984X 176-11 


1671 


1270 


5N984X176-11 


1573 


1522 


5N984X1 76-11 


1675 


943 


5N984X 176-1 1 


1679 


967 


5N984X171-4B 


1942 


15 


5N984X171-4B 


1946 


16 


5NA56X171-16A8X 


1101 


30 


5NA89X1 76-11 


1622 


959 


5NA89X176-11 


1630 


1172 


5NA89X176-11 


1635 


1 100 


6r0l0X17l.4 


825 


103 


6r010Xl7l-4 


832 


1298 



■Bt levels are in ng crylA{b).mg total protein 

m;^iz? tr:^n*ifornnants exoressina the crvlAibl 

protein. 

-ELISA anaJysis of transgenic plant material was c;^rned out using standard procedures 
as described elsewhere. 
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FIG. 23B 




63/94 



wo 93/07278 



/US92/08476 



FIG. 23C 

CrvlA(b) Protein LevQls in Transgenic Maize 
Greenhouse plants 



35S LINE 


LEAF 


PITH 


ROOT 


POLLf 


5F010 X 171-4A 


-409 ^ 288 


NT 


NT 


NT 


5N984 X 171-14A 


256 + 159 


191 


198 


30 


6F010 X 171-16AB 


240 -^174 


221 


271 


NT 


5N984 X 171-13 


201 + 94 


NT 


NT 


NT 


5NA89X 171-13 


37 -K 7 


150 


0 


NT 


5N984 X 171-18 


7.7 -t- 3 


NT 


NT 


NT 


6N615 X 171-16AB 


7.5 + 3 


0 


0 




PE?C LINE 










6N615 X 176-11 


1125 419 


41 


19 


NT 


5F010 X 176-10 


774 ^ 1 59 


NT 


NT 


130 


5N984 X 175-1 1 


7 1 9 - 1 28 


16 


20 


186 



-Bt levels are in ng crylA(b)/mg total protein 

Data are from progeny of the described maize transformants expressing the crylAfb) 
protein. 

ELISA analysis ot transgenic plant material was earned out using standard procedures 
as aesrnbed elsewhere. 
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FIG. 23D 
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FIG. 



23E 



EXPRESSION OF THE CaYIA(b) GENE IN TRANSGENIC MAIZE USING THE PITH- 
PREFERRED PROMOTER 

Leaf samples from small piantiets transtomned with DCiB4433 using procedures 
described elsewhere were analyzed for the presence of the crylA(b) protein using 
ELISA. All plants expressing crylA(b) were founa to be insecticidai in the standard 
European com borer bioassay. 

Note that the pith -preferred promoter has a low. but detectable level of expression in 
leaf tissue of maize. Detection of CfyiA(b) protein i? consistent with this pattern of 
expression. 



PLANT NUMBER 



ng cylA(b)/mg protein 



JS21A-1 TOP 
JS21A-2 TOP 
JS21A-3 TOP 
JS2lA-n TOP 
JS21A-12 TOP 
JS21 A-13 TOP 
JS21A-14 TOP 
JS21A-19TOP 
JS21A.24 TOP 
JS21A-28 TOP 
JS22D-3 MID 
JS22D-4 MID 

J522D-1 ^ MtD 
JS22D-17 MID 



113 

127 

112 

9T 

118 

82 

0 



169 
0 



154 

2946 

5590 



?00-i 
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rIG. 24 

1 GAATTCGGATCCATTAAAGAAGTCrTTGAACAGATTCTAGAGATCTAGTTrAATGAGCTC 60 

6 1 CCAAAAGTCTTGAAAAAATTCAGCGGGGAGGCCATTAGGGCAGGGGTACTGTTATGTTT? 120 

' 121 AAAGAGAACACCACTTTCTTGATCTCTTCTAAAGAGAAATGTTTTGTAAGAAGGATCCTG 180 

181 TCCTCCTCATCCAACCTTTTCATCGGCAAATTTTT CATAGAGATATTAGAGGCAAGAGAG 240 

241 GGGCCAAAAAGATCCATGTAAATGGAAGTGGCCACCTGGTTGATACCTCCCTCATCTTCA 300 

301 ACAGAAAATCCATTATGAAAAAGTGAATGGATTTTAAACTCTTCTTTTTCTTCCCTTTTG 360 

3 61 CAATGAGCTGAAAATATCTGGTATTATTCTCATCACCCTCATTAATGAATCTGTCCCTAG 420 
421 CAATTTGCTTTCTCTTGATCCCTTCTGCAGCCACCATGTTTCTTAAATTCCACTCCATAT 480 

4 81 CAAGCTTTTCCAATCTATCAGAATCTGAGATGGCTGCAATCTCTCTCATTTTCTCAAGGA 540 
541 TATCGATGTTATCCATAAGGTATTTCTTGAACTTCTTATATTTCCCTTCGACATTTATAT 600 
601 TCCATCCTTTCAACATTTTTTTGTTCAATCTTTTTTGTTTTTTTCCr TTCCAAACATCGA 660 
661 TACATTTCCTGCTCCTCACAGGTAAGGACGAGCTTTCAAAAAACCTTCTGCTTTAAAGTC 720 
7 21 AGGTCTGAGCCTCCAGCAAAGCTCACATATCTAAAGTCCCTCTTCTTAGTTGGGACAGAG 780 
781 TCAGTGCTAAGACACATGGGAACATGACCAGAAAAAAAAAATCATATTTAGCCCAGAGAC 840 
841 AACAATATTCTTGTACTGCAAGTCTCGTTATGGGCTAGCAAAGGAATCTACCCAACTTCT 900 
901 CAAATGTGTTGGGATGTCAAGTATATAGACTATTCATCAGTTCCAACTCTATCAAACTGT 960 
961 GCAGCTCAATTATAGAGTTGAATAAAGTGCTCCATCTATTTGTTCTTATCCTCATATTTG 1020 

1021 GTTAAGATATTAAAATCACCTCCCACCAACATTTAAAGTGCACCATTTAAAGTGGCTCGC 108C 

1081 GAGCACCAAACCGCTGAAAACCGGAAATGTTTAGCACG7TGGCAGCGGGACCCTTTTCTA 1140 

1141 TCTCATCGTGTTCTTCGTTGTCCACCACGGCCCACGGGCCAACGCTCCTCCATCCTGTAG 1200 

1201 TGTAGAGTATATTCCATTTGCGACCGAGCCGAGCATCGATCCAGCCACACTGGCCACTGC 1260 

84 

1261 CAGCCAGCCATGTGGGACTCCTACGTATACTACGTGA GGTGAgAT?^^ 1320 

-465 -405 

1321 GGGACCGAGATATTTTACTGCTGTGGTTGTGTGAGAGATAATAAAGCATTTATGACGATT 138C 

1381 GCTGAACAGCACACACCATGCGTCCAGATAGAGAAAGCTTTCTCrCTTTATTCGCATGCA 1440 

1441 TGTTTCATTATCTTTTATCATATA'TATATAACACATATTAAATGATTCTTCGTT CCAAT T 1500 

-285 -226 

1501 TATAATTCATTTGACTTTrTTATCCACCGATGCTCGTTTTATTAAAAAAAATATTArAAT 15 60 

-225 -166 

15 61 TATTGTTACrTTTTGTTGTAATATTGTTTAGC ATATAATA AACTTTGATACTAGTATGTT 1620 

-165 -106 

49 

1621 TCCGAGCAAAAAAAAATATTAArATTTAGATTACGA GCCCATTI^TTAarr!^ 1680 
-105 -46 

33 +1 

1681 ACJU^GCGAAGCAAAGCAAAGCAA GCrSAATGTrGCC^ 1740 
-45 +15 

73******* 

1741 CTTGCTATAAACGAGGCAGCTAGACGCGACTCGACTCArCAGCCTCATCAA CCTCGaC^ 1800 
+16 +75 

1301 AGGAGGaACGAACSGA CAGGTTGTTGCACAGAAGCSACATGGCTTTCGCSCCCAAAACGT 1860 

•^76 MATAPKTS +135 

3S 
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FIG. 24 



18 61 CCTCCTCCTCCTCGCTGTCCr rCGGCGTTGCAGGCAGCrCAGTCGCCGCCGCTGCTCCTGA 1922 

+ 126 SSSSLSSALQAAQSPPLLLR -f-19£ 

.40 + 41 

1921 GGCGGATGTCGTCGACCGCAACACCGAGACGGAGGTACGACGC GGCCGTCGTCGTCACTX 198: 

t196 RMS STAT? RRRYDAAVVVT T +25 = 

1981 CCACCACCACTGCTAGAGCTGCGGCGGCTGCTGTCACGGTTCCCGCCGCCCCGCCGCAGG 2 0 4 C 

+256 TTTARAAAAAVTVPAAP P QA +315 

75 $ 

2041 CGGGCCGCCGCCGCCGGTGCCACCAAAGCAAGCGGC GGCACCCtKafiAfWAffGaGCC ^ 2100 

+316 GRRRRCH Q S KRRHP QRRS R? -1-375 

2101 CGGTGTCGGACACCATGGCGGCGCTCATGGCCAAGGGCAAGGTTCGTATAGTACGCGCGC 21 60 
+376 VSDTMAALMAKGK 

2161 GTGTCGTCGTCGTTATTTTGCGCATAGGCGCGGACATACACGTGCTTTAGCTAGCTAACA 222 ' 

2221 GCTAGATCATCGGTGCAGACGGCGTTCATCCCGTACATCACCGCCGGCGACCCGGACCTA 228 : 

TAFIPYITAGDPDL 

22 81 GCGACGACGGCCGAGGCGCTGCGTCTGCTGGACGGCTGTGGCGCCGACGTCATCGAGCTG. 234 2 
ATTASALRLLDGCGADVIEL 

2341 GGGGTACCCTGCTCGGACCCCTACATCGACGGGCCCATCATCCAGGCGTCGGTGGCGCGG 2400 
GVPCSDPY IDGP I IQASVAR 

24 01 GCTCTGGCCAGCGGCACCACCATGGACGCCGTGCTGGAGATGCTGAGGGAGGTGACGCCG 24 6: 
ALASGTTMDAVLEMLRSVT? 

24 61 GAGCTGTCGTGCCCCGTGGTGCTCCrCTCCTACTACAAGCCCATCATGTCTCGCAGCT.TG 252 ! 

SLSCPVVLLSYYK? IMSRSL 

2521 GCCGAGATGAAAGAGGCGGGGGTCCACGG7AACTATAGCTAGCTCTTCCGATCCCCCTTC 2 5 8 
AEMKSAGVH 

25 81 AA.TTAATTAATTTATAGTAGTCCATTCATGTGATGATTTTTGTTTTTCTTTTTACTGACA 264 0 
2 641 GGTCTTATAGTGCCTGATCTCCCGTACGTGGCCGCGCACTCGCTGTGGAGTGAAGCCAAG 2700 

GLIVPDLPYVAAHSLWSEAK 

27 01 AACAACAACCTGGAGCTGGTAGGTTGAATTAAGTTGATGCATGTGATGATTTATGTAGCT 2760 
N N N L E L 

27 61 AGATCGAGCTAGCTATAATTAGGAGCATATCAGGTGCTGCTGACAACACCAGCCATACCA 2820 

VLLTTPAIP 

2821 GAAGACAGGATGAAGGAGATCACCAAGGCTTCAGAAGGCTTCGTCTACCTGGTAGTTATA 2880 
EDRM-KS ITKASSGFVYL 

2 881 TGTATATATAGATGGACGACGTAACTCATTCCAGCCCCATGCATATATGGAGGCTTCAAT 2940 
2941 TCTGCAGAGACGACGAAGACCACGACGACGACTAACACTAGCTAGGGGCGTACGTTGCAG 3000 

3 001 GTGAGCGTGAACGGAGTGACAGGTCCTCGCGCAAACGTGAACCCACGAGTGGAGTCACTC 3060 

VSVNGVTGPRANVNPRVSSL 
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3061 ATCCAGGAGGTTAAGAAGG7GACTAACAAGCCCGTTGCTGTTGGCTTCGGCATATCCAAG 3120 
IQSVKKVTNKPVAVGrGISK 

3121 CCCGAGCACGTGAAGCAGGTACGTACGTAGCTGACCAAAAAAAACTGTTAACAAGTTTTG 318 0 

? E H V K ■ 

3181 TTTGACAAGCCGGCTACTAGCTAGCTAACAGTGATCAGTGACACACACACACACACAGAT 3240 

Q I 

32 41 TGCGCAGTGGGGCGCTGACGGGGTGATCATCGGCAGCGCCATGGTGAGGCAGCTGGGCGA 3300 
AQWGADGVI IGSAMVRQLGS 

3301 AGCGGCTTCTCCCAAGCAAGGCCTGAGGAGGCTGGAGGAGTATGCCAGGGGCATGAAGAA 33 60 
AAS PKQGLRRLSE YARGMKN 

3361 CGCGCTGCCATGAGTCCATGACAAAG7AAAACGTACAGAGACACTTGATAATATCTATCT 3420 
ALP 

3421 ATCATCTCGGAGAAGACGACCGACCAATAAAAATAAGCCAAG7GGAAGTGAAGCTTAGCT 34 80 

34 81 GTATATACACCGTACGTCGTCGTCGTCGTTCCGGATCGATCTCGGCCGGCTAGCTAGCAG 3540 

3 5 41 AACGTGTACGTAG7AG7ATGTAATGCATGGAGTGTGGAGCTACTAGCTAGCTGGCCGT7C 3 6 0 C 

3 601 ATTCGA7TA7AA7TC77CGC7C7GC7G7GG7AGCAGA7G7ACC7AG7CGA7C77G7ACGA 3 6 6 C 

3661 CGAAGAAGCTGGC7AGC7AGCCG7C7CGATCG7ATA7G7ACTGA7TAATC7GC:AGATTGA 3720 

$ 

3721 ATAAA AACTACAG7ACGCA7A7GA7GCG7ACG7ACG7GTG7A7AG7TTG7GC7CATATA7 3780 

37 81 GC7CCTCA7CACC7GCC7GA7C7GCCCA7CGA7C7C7CTCG7AC7CC77CCTG77AAATG 3840 

38 41 CC77C777GACAGACACACCACCACCAGCAGCAG7GACGC7CTGCACGCCGCCGCTT7AA 3900 
3 901 GACATG7AAGA7A77T7AAGAGG7A7AAGA7ACCAAGGAGCACAAA7C7GGAGCAC7GGG 396: 

3 9 61 A7A77GCAAAGACAAAAAAAAAACAAAA77AAAGTCCCACCAAAG7AGAGATAG7AA^ 4020 

4 021 GG7GGATGGA77AAAA77A7C7CA7GA77777GGA7C7GC7CAAA7AGA7CGA7ATGG7A 4 080 
4 081 7TCAGA7C7A7G77G7A7AGCC77T7CA77AGC7TTC7GAAAAAAAAA7GGTA7GA7GAG 414 0 
4141 7GCGGAG7AGC7AGGGC7G7GAAGGAG7CGGArGGGC77CCACG7AC77G7T7G7GGCCG 4200 
4 201 TAG7CCGG77CTA777AGG7CCGA7CCGAG7CCGGCA7GG7CCGG77CCA7ACGGGC7AG 4260 
42 61 GACCAAGCTCGGCACGTGAG7777AGGCCCG7C3GC7AGCCCGAGCACGACCCG77777A 4.320 
4 321 AAC7GGC7AGGACTCGCCCA777AA7AAGACAAACATTGCAAAAAATAGC7C7A7T777T 4380 
4 381 A7T7AAAA7A7A7TG777A777G7GAAA7G7G7ATTA7T7G7AA7A7A7A77A77G7A7A 4440 
4 441 7AG77A7ATC7TCAA7TA7GA777A7AAA7A7GTTT77TA77A7GAAC7CAA7T77AAG7 4500 
4 501 77GA777A7GCG7TGGCGGGC7CGAGGAGGCACGG7GAACA77777GGG7CGGGC77AAC 4560 
4 561 GGG7CGGCCCGGCCCGG77CGGCCCA7CCACGGCCCA7CCCG7G7CGGCC7CG7TCGG7G 4 62 0 
4 621 AG77CAGCCCG7CGGACAACCCG7CCCCGGCCCGGA7AA77AA7CGGGCC7AACCG7GGC 4 680 
4 681 G7GC77AAACGG7CCG7GCC7CAACGGACCGGGCCGCGGGCGGCCCG7T7GACATC7C7A 4740 
4 7 41 G7GG7G7GATTAGAGA7GGCGA7GGGAACCGA7CAC7GA77CCG7G7GGAGAA7TCGA7A 4 800 
4 801 7CAAGC77A7CGA7ACC 4 817 

Figure 24. 

Entire sequence of the maize 7rpA gene, with introna 

and exona, transcription and translation strata, start and 

srop of cONA. 

$ = start and end of cDNA; -1 = transcriot ion start; 73******* = 
primer extension primer; i = srar- of translation; ^-M• = srop 

codon; « CCAA7 Box, TA7AA 3ox, poly A addition site." 

# above underlined secuences are ?CR crimers . 
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21 ID 5N984 

39d 46d 
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Norrhern bior showing dif f erer.rial expression of 
TrpA gene in mai-e tissues. 2 hour exposure against 
f-im at -80C with lupcnt Crcnex intensifying screens. 
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2110 5N98^ 

P C 3R £S LP MP UP C BR £S LP MP UP 




211D 5N984 

I 

39d 46d 



I il i 

S L R P SH LP UP LP MP UP 







Northern bier shewing differential expression of 
TrpA gene in maize tissues. 4 hour exposure against 
film at -aOC with Duoont Cronex intensifvinc screens. 
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211D 
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21 ID 



5N984 



39d 46d 



1 I i 

S L R P SH LP UP LP MP UP 




OTwiiern bioc showing dif f erenc-ai expression of 
rpA gene in maize tissues. 13 hour exposure againsr 
ilm at -eOC with Duponc Cronex intensifying screens. 
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FIG. 25D 
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211D 5N984 
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2110 5N984 21 ID 

L P L P 'si S2 S3 S4'S5 




Norrhern clcz show:.ng maize TrpA gene expression in 
Funk lines ZllD and 5N984 leaf and pith and zhe absen 
of expression in 211D seed ^oral RNA. 

65 hour exposure aga:.nsT: filn at-3CC wirh Dupont Cron 
intensif vina screens . 

4 — 
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10X 5X IX S H EV E B 



9.4- • O 

4^- 



2.0- 



1.2— €^ O 



Genomic southern cf r-jnJc line 211D prab'^d w<t'' -.'i- 
TrpA cDNA 8-2. 3 = 3amHI , Z = EcoRI, Ev'= EcoRv" ' 
H = Hindlll and S = Sad. 

120 hour exposure againsr fil.n at-80C with Ducon- --one 
xntensifvina screens. 
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Primer exrenaion showing the transcription start of TrpA 
gene sind sequencing ladder. 

1 hour exposure against film at-80C with Dupont Cronex 
intensifying screens . 
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FIG. 28B 



42^*0 



48 



12 3 12 3 




54^C 




-309 




RNase protecricn cf recicn from -^2 bp to -33* bo 
with three annealing temoerures. 

15 hour exposure against' film at-50C with Dunont Cronex 
inrensif vine screens . 
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Maize Pollen CDPK cDNA sequence 

sequence contained in clones pCIB3168 and pCiB3l69 



1 TG CAG ATC ATG CAC CAC CTC TCC GGC CAG CCC .^C GTG OrTG GGC CTC CGC -GGC -^CG 
l^Gln Me Ntet His His Leu Ser Gl y Gin Pro Asn Val Val GJ y Lau A rg Gl y Ala" 

57 TAC GAG GAC AAG CAG AGC GTG CAC CTC GTC ATG GAG CTG TGC GCG GGC GGG GAG CTC 
IP^Tyr Glu Asp Lys Gin Sar Val His Leu Val Mel Gl u Lau Cys Ala Giy Gl y Gl u Leu 

Aval 



114 TTC 


GAC CGC 


ATC 


ATC 


GCC 


CGG 




CAG 


TAC 


ACG 


GAG 


CGC 


GGC 




/^^^ 


GAG 


CTG CTG 


33> Phe 


Asp Arg 


1 1 a 


t le 


Al a 


A rg 


Giy 


Gin 


Tyr 


Thr 


Glu 


A rg 


Gly 


A( a 


At a 


Glu 


Leu Lau 






GTG 


CAG 


ATC 


GTG 


CAC 


ACC 




— > 


TCC 


ATG 


GGG 




ATG 


CAC 


S.GG GAC 


57 ► A r g 


Ala lie 


Val 


Gl n 


1 t e 


Vai 


Hi s 


Thr 


Cys 


Hi s 


Ser 


Mat 


Gly 


Vai 


Met 


Hi s 


A rg Asp 



Aval 

223 ATC AAG CCC GAG AAC TTC CTG C?G CTC AGC AAG GAC GAG -GAC GCG CCG CTC .^G GCC 

'•f^lie Lys Pro Glu Asn Phe Lau Leu Leu Ser uys Asp Glu Asp Ala Pro Lau Lys Aia 

295 ACC GAC TTC GGC CTC TCC GTC TTC TTC AAG GAG GGC GAG CTG CTC AGG GAC ATC 7X 

95>Thr Asp P^se Gly Leu Sar Vai Phe Phe Lys Gi u Gly Gi u Leu Lau Arg Asp Me Val 



Aval 



342 GGC 








TAC 


ATC 




CCC 


GAG 


vji, o 


CTC 


AAG 




AAG 






CCG 




CCC 


114>G1 V 


Ser 


Al a 


Tyr 


Tyr 


1 ! e 


Ala 


Pro 


Gl u 


Val 


Leu 


tys 


A rg 


Lys 


T V r 


GI V 


Pro 


Gl u 


Al a 


2r9 GAC 


ATC 






GTC 


GGC 


GTC 


A7^ 


CTC 


<Y>« — 


* 1- 


TTC 


CTC 






'jPG 


Cc ^ 


CCC 


TTC 


:33^Aso 


i 1 6 


T rp 


Ser 


Val 


Gly 


Val 


Met 


Leu 


Tyr 


1 1 e 


Phe 


Leu 


Al a 


Gl y 


Val 


Pro 


Pro 


Phe 


456 TGG 




GAG 


AAC 


GAG 


AAC 




ATC 








ATC 


CTG 




/-»»^*-* 


CAG 








151 ► T f p 


Al a 


Glu 


Asn 


GI u 


Asn 


Gly 


{ 1 e 


Phe 


Thr 


Al a 


1 1 e 


Leu 


A rg 


GI y 


Gl n 


Lau 


Asp 


Lau 


? 13 TC* 


AGC 


'GAG 


CCA 






CAC 


ATC 


TCG 




•GGA 




AAG 


GAT 


CTC 




AAG 




ATG 


:71>Ser 


Ser 


Gi u 


Pro 


Trp 


Pro 


Hi s 


1 1 e 


Ser 


Pro 


Gly 


Al a 


Lys 


Asp 


Lau 


Vai 


Lys 


Lys 


l^tot 


570 CTC 


AAC 


ATC 


AAC 




AAG 


GAG 


CGG 


CTC 


ACG 




TTC 


CAG 


GTC 


CTC 


.^AT 


CAC 


'CC-i 


TGG 


190^L9u 


Asn 


1 1 e 


Asn 


Pro 


Lys 


Gl u 


A rg 


Leu 


Thf 


Al a 


Phe 


Gl n 


Val 


Lau 


Asn 


His 


Pro 


Trp 


627 ATC 


AAA 


GAA 


GAC 




GAC 


GCG 


CCT 


GAC 


ACG 




CTT 


GAC 


AAC 


GTT 


GCT 


CTC 




AGG 


209> 1 f e 


i-ys 


GI u 


Asp 


Gly 


Asp 


Al a 


Pro 


Asp 


Thr 


Pro 


Leu 


Asp 


Asn 


Vai 


Vai 


Leu 


Asp 


A rg 


634 CTC 


AAG 




TTC 


AGG 


GCC 


ATG 


AAC 


CAG 


TTC 


AAG 


AAA 


GCA 


GCA 


CTG 


AGG 


ATC 


ATA 


GCT 


229^Lau 


Lys 


Gi n 


Phe 


A rg 


Al a 


Mat 


Asn 


G n 


Phe 


Lys 


Lys 


Al a 


Al a 


Lau 


A rg 


Me 


i 1 e 


Al a 


741 GGG 


TGC 


w A A 


TCC 


GAA 


GAG 


GAG 


ATC 


ACA 




CTG 


AAG 


GAG 


ATG 




AAG 


AAC 


ATT 


GAC 


247^GI y 


Cys 


Leu 


Ser 


Gl u 


Gl u 


GI u 


1 i e 


Thr 


Giy 


Leu 


Lys 


Gl u 


Met 




Lys 


Asn 


1 ! a 


Asp 


798 AAG 


GAT 


AAC 


AGC 




ACC 


ATT 


ACC 


^T*C 


GAC 


GAG 


CTC 


AAA 


CAC 




TTG 


GCA 


AAG 


CAC 


266^Lys 


ASD 


Asn 


Ser 


Gly 


Thr 


1 le 


Thr 


Lau 


Asp 


Gt u 


Lau 


Lys 


His 


GI y 


Leu 


Ala 


Lys 


His 


955 GGG 


CCC 


AAG 


CTG 


TCA 


GAC 


AGC 


GAA 


ATG 




.^AA 


A A 


ATG 


GAA 


GCA 


gct' 


GAC 


GCT 


GAC 


285 ►Gly 


Pro 


Lys 


Leu 


Ser 


Asp 


Sar 


GI -J 




G! u 




Lau 


^^t 


Gl u 


Al a 


Al a 


Asp 


At a 


Asp 


















EcoRt 




















912 GGC 


AAC 


GGG 


TTA 


ATT 


GAC 


TAC 


GAC 


GAA 


T^PC 


GTC 


ACC 


GCA 


ACA 


GTG 


C^iT 


Ax \J 


.2iAC 


AAA 


304 ►Giy 


Asn 


Gly 


Leu 


t 1 e 


Asp 


Tyr 


Asp 


Gl u 


Phe 


Val 


Thr 


At a 


Thr 


Va! 


His 


Mat 


Asn 


Lys 
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969 CTZ 


GAT 


AGA 


GAA 


G>\G 


CAC 


CTT 


TAC 


ACA 


GCA 




CAG 


TAT 




GAC 


.^AG 


GAC 


AAC 


AGC 


*m ! ail 


Asp 


A rg 


Vdl u 


\M U 


Ui c 
m s 


1 <k 1 f 


Tvr 
1 yr 


1 1 1 1 


A 1 A 

^ 1 a 


t 1 • o 






Phe 


Asp 


Lv s 




A c n 

4k ft 




1026 GGG 


TAC 


ATT 


ACT 


AAA 


GAA 


GAG 


CTT 


GAG 


CAC 




TTG 


AAG 


GAG 


CAA 




TTG 


TAT 


GAC 


r ui y 


To* 

ryr 


1 1 8 


1 nr 


Lys 




fii II 

\M U 




CX\ II 


Mi c 


A 1 a 


1 oil 
LOU 


i_y 5 


Gi u 


Gl n 


vji y 


1 Oil 
L O U 


I yr 




1083 GCC 


GAT 


AAA 


ATC 


AAA 


GAC 


ATC 


ATC 


TCC 


GAT 




GAC 


TCT 


GAC. AAT 


GAT 


GGA 


AGG 


ATA 


361^Ata 


Asp 


Lys 


1 1 e 


Lys 


Asp 


1 1 a 


1 la 


Sar 


Asp 


Al a 


Asp 


Sar 


Asp 


Asn 


Asp 


GJy 


A rg 


11a 


1140 ZkT 




TCA 


GAG 


TTT 


GTG 


GCG 


ATG 


ATG 


AGG 


AAA 


GGG 


ACG 


GCT 


GGT 




GAG 




ATG 


38C^Asp 


Tyr 


Sar 


Gl u 


Ptie 


Val 


Al a 


Mat 


Mat 


A rg 


Lys 


Gly 


Thr 


Ai a 


Gly 


Al a 


Glu 


Pro 


Mat 


1197 AAC 


ATC 


AAC 


PAG 


AGG 


CGA 


GAC 


ATA 


GTC 


CTA 


TAG 


TGAAGTGAAGCAGCAAGTGTGTAATGTAATGTG 


299>Asn 


1 la 


Lys 


Lys 


Arg 


Arg 


Asp 


1 ] 8 


Val 


Lau 


• • • 



















12 52 TArAGC-.GCTCAAACV^CCAAATTTGTACATCTGTACAC^^^ 



1340 AAAAAAAAAA 



80/94 



wo 93/07278 
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inner leaf sheath 

inner leaf whorl 

green leaf 
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pollen 

silk 

kernel 

pith 

root 
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FIG. jZ 

liDman*Pearson Protdin AJignmertt 
Gap Penalty: 2, Gap Langth Penalty: 12 

Saqi Seq2 Stmilanty Gap Gap Consensus 
ooi CDPK ptn raloic2 ptn Index Numtaer Length Length 

1>5S1 1>S28 36J 4 4 297 

Doi CDPK ptn YSMGKELGRGaFGVTHLCTHRTSGEKLACKTUiCRlCLAAREDVDDVRREVQIMHHLSGQPNVVGL^lGAYE 162 

^ : ELG: G. F: V. : C. . : TS. : A ic. I KL: AR: : : . :• RE. : i : • • PN* V = • • • 

-ax pr2 ptn rOLfEELGkGAFSVVRRCVKXTSTCEYAAKI If^TKKLSAROH-GKLEREARICRLLX-HPNIVRrHbisiS 8) 

pol CDPK ptn :kQSVHLVMELCAGGELFDRI lARGGYTERGAAELLRAIVQI VHTCHSMGVMHRDIKPENFLLLSKDEDA 232 

LV. : L '. GCELFr \i AR. Y: E : A; : I* * V* H * ' * HRO' KPEN' LL SK ' A 

-•at pk2 ptn EEG?HYLV*FbLVTGGELi='ED IVAREYY'sEAOAsk I HQ i LeSVNH I HQHo'l V 15' 

poi COPK ptn PLKATDFGLSVFFKEGELLR-OIVGSAYYUPEVLKRlC-VGPEAOIWSYGVMLYfrLAGVPPrVAENEVG 300 

: : K : OFGL: : : : : . G: : Y: : PEVL: : . YC . DIW: GV: LY 1; L. G PPFW. E: : : 

-at pk2 ptn AVKLA0FGLAIEYOGEQQAWFGFAGTPGYLSPEVLRKDPYG)CPVDIWACCV!LYILLVCYPPFVOEDQH< 22* 

Dot CDPK ptn IFTAILRGaLDLSSEPWPHISPGAKDLVKKMLNlNPKERLTAFQVLNHPVlKEDGDAPOTPLDNVVLDRL 370 

■ I G. 0; : S W : : P. AiC: L; : : H i NP R: TA Q L; HPW: : : ■ i 

'•at- 3^2 Ptn -YQQ|)CAGAYDFPSPEWOTVTPEA<NLINQML''!^PAKRITAOCALKHPWVCGRSTVASMMHRQETVECL 29' 

pa ZZPK Dtn <CFRAMNCr KKAALR I I 387 

: : r . A. . : : K A L . : 

-=t p<2 p:- ^<">;arr<^<ga:."'** 308 
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FIG- 33 

Lipman-Pearson Protein Altgnment 
Gap Penalty: 2. Gap Length Penalty. 12 

Seql Seq2 Simlartty Gap Gap Consensus 
poi CDPK Dtn humcama ptn Index Number Lengtn Len c^ 

l>551 1>150 40^3 2 2 142 

poi CDPK oxr LSEEEITCDCEMFKNIOlCDNSCTITLOELlCHGLAKHGPiCLSDSEMEKLMEAAOAOCNCLIOYOEr VTATV usC 

',;E£;1: : <E • :DICO. .GTIT EL . G. : ::: E: OAOGNG 1 D: EF: T . 

nwmcomo Ptn lTEEQ I AEFKEAf SLFOlCOGOGT I TT<ELGTVMRSLGQNPTEAELQO« I NEYOAOGNGT I orPEFLTHMA 74 

DOt C:PK H-rtNKLDREEHLYTAFQYFOKONSCYlTKEELEHALKECC-LYOAOKUOI ISOADSDNOCRIDYSEFVA 528 

: M: , 0. EE: : • AF: FDKO. ; GY I; . £L H. : . : C K : AO 0. DC; :: Y, EFV 

humcama ptn RICflKDTOSEEE I REAFRVFOKOCNCY I SAAELRHVMTNLCEKLTDEEVOEM I READ I DGOGQ VNYEEFVG 1 aa 

poi CDPK d:'> mm 530 

MM 
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FIG.. 34 

Lioman-Pearson Protein Alignment 
Gap Penalty: 2. Gap Length Penalty: 12 

Seql Soq2 Simiartty Gap Gap Consensus 
poiCDPK otn soybean CDPK ptn Index Number Length Length 

1>551 1>509 S2A ] ] 464 

Doi CORK otn VLGRPMEDVRATYSMC)(ELGaGQFGVTHLCTHRTSGE<LAC)CTUKRKLAAREDVOOVRREYCIMHHLSG '50 

II i • • I • • i • i I : I ) It : I : M : i : I ! : I I I I : t : M I I : : : I I : I 1 : ! I : : ; ! i M ) : 

soybeon ZZ?< ptn VLPQRTQN I REVYEYGRKLGQGOFGTTFECTRRASCCKFACJCS I PKRKLLCKEDYEDVWRE I Q i MHHLSE a" 

001 ^CP< Ptn QPNVVGLRGAYEDICQSVHLVHELCAGGELFORI iARGQYTERGAAELLRAIVQiVHTCHSMCV-HROIICP 220 

••111- •fill- : 11 I 1 i M I: ! I i 1 I M I :.: I : t : i I : I 1 : i : : I I. I : I I i MINI U 
soybean .C:PK ptn HANVVR I EGTYEOSTAVHLYMELCECGELFDR I VQKGHYSERQAARL flCT I VEVVEACHSLCV«HROL<P '61 

DOi ::?K otn ENFLLLSKDEOAPL<ATDFGLSYFFKECELLROIVGSAYYIAPEVLICRKYGPEAOIWSVGYML^!FLAGY 290 

I II I • : M I 1: I I M i I i I 1 M: i. I I : ! M I: M I f I I I: : . I I 1 I : ! i I: i I : It I: I: I t 
soybeon ptn ENFLFOT I OEDA»CL<ATDFGLSVFY)CPGESFCOVYGSPYYYAPEYLRICLYGPESDVWSAGY I LY ILLSGV 23^ 

Doi CDP< otn PPFWAENENGIFTAILRGOLDLSSEPWPHISPGAKDLVIOCMLNINPKERLTAFQYLNHPWUEDGDAPOT 350 

II I M I: I: 1 M : 1 I 1: I i : i t II I : I I : : I I i t : ; I M : I II: M M: . M MM: ':.!(!: 
soyoeon ::?< p^n PPFVAESEPG I FRC I LLGKLOFHSEPWPS I SDSAKOL i R1CMLDQNP<TRLTAHEYLRHPV1 VO:n [ APOK 3:i 

00 otn PLDNVVL0RL»CQFRAMNQF<>;AALRI IAGCLSEEE!TGL)CEMFICNIDKONSGTiTLDELKHG;.A<HG?<L i.30 

t t I: : M Mill M I: : I !: M 1 11: : M t II I: It M t i II Mi 1 I I 1 M M: I I: : I: : I 
scybeo- :: = < ot" PLDSAVL SRL<C=" S A'*S'<'- <<"AL=?Y ! AE^*. SEETI CGL <E'.='<M ! O'^^SSG' ! ^E*. <C:, <-VGSEL V 

00 1 otn SOSEMEKLMEAADADGNGL 1 OYOEFVTATVHMNlCLDREEHLYT AFQYFDKDNSGY ! TKEELEHALXEQCL 500 

: : I I : : M ! 1 I : i : : ! . t M : M : : M M i M : I i ! I : 11 . M M I : I I 1 I I , I . : t 1:11 
soyoeon Z2^< ptn MesE I KDLMOAAO I OKSGT i DYGEF I AATYHUNKtEREENLVSAFSYFOKDGSGY I TLDE 1 OCACJCDFGL iOI 

DO- Dtn YCAOtCI<DIlSOACSDNOGRlDYSEFYAHMRKGTAGAEPMNl<< 54^* 

1; . I: ': I: : : !: M ; i: M f: M: M M I 1: : 1: : : : : : I 

3c,oec- ::=< ct" -:::-:::-:<:::::N:G:::^Gr"AA— ^<:sGc;Gn^*-R< ^isa 
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FIG. 35 

pol CDPK gen Map (1 > 4165) She and Sequence 

Enzymes : 6 1 98 enzymes (Filtered) 

Settings . Cireuiar. Canain Sites Ontv, Standard Genetic Cod 

:tagtaacac:tctc:aatcgcttgggttggcacattcttagcttttatcacattttaagaaatagagttcaccaccttc 
, : ; 1 > . 80 

AAAATATGCCTATACAATGAATGATGCTTGGATGCAATATAGCTAGATTCAACTAGCTATATATGGTCAATAGAACCCTG 
; ■> ^ 160 

tgagcac:tcacaaacacgacttcaattttgagaccctaagc3AGTaaatggttaaagtc:tcttattattagtcttagg 

^ 1 ^ > 1 ^ 2*10 

ACTTCTCCTTGCTAAATGCTTGTCAGCGATCTATATATCTTCCCCACTGCGGGA6ATACTATATATAGGGCCTTGGACCT 
; ; ^ ; ^ : u 

CTAGGGTATCTCAAAGGCCTAGTCACAACAATTCTCAACAGTATTTAATTTTATACATGTATGAACAGTGTAGGAATTTG 

• ; ; ; , , _ . U 

AGTGCCCAACCCAAGAGTGGGAGGTGTAAATTGGGTAGCTAAACTTAAATAGGGCTCTTCTTATTTAGGTTTATCTAGTC 
^ ; i ^ 480 



r T 



; ACTTAGACTAA7TCAGAAAGAATTTTACAACCTATGSTTAATCATATCTCTAGTCTAAGCAAATTTAGGAAAGTTAA 



560 



AAGCACACAATTAGGCACATGTGAAAGATGTGTATGGTAAGTAAAAGACTTATAAGGAAAAAGTGGGTGAATCCTCAAGA 
: ^ , i ^ ; ; ; L 540 

TGTGGTGGTATATCC:AATGATATTAGAT5CCAGAATATAGGGGGGAAATCGATGTATACCATCTCTACCAGGATACCTG 
> ^ , , 1 ^ 1- 720 

tgcggactgtgcaactgacacatggaccatggtgtcttcttagatt-ggttattagctaattgcgctacaacttgttcaa 

i , i , ; , u 800 



ggctagaccaaattaaaaaactaatattaaacataaaaagttaggcaaactatagtaaattatgcagcgatccaacaaca 

• ; , i i ; : ^ 830 

AGCCATGTCTC3TGGGTCATGAGCCACGCGTCGGCCATACACCCACATGATGTTTCCATACGGATGGTC:TTATGCAATT 

; ^ ; , ^ 

TTGTCTGCAAAACACAAGCCTTAATACAGCCACGCGACAATCATGGAAGTGGTCGTTTTAGGTCCTCATCATGAAGTTCA 

— . ^ . : ; ; ^ l ^q^O 

gggaaaacgcatcaaatgtaatgcagagaaatggtatttcttctcttgtaaatcagggagaggagtaccatcagtacaga 

— ' ' ' ^ ^ 1120 

EcoR I 

ttcagaatcagaattcagtcttccaacgacaataatcgcagcatcttgtaaaaatttgcagaaacttctgtttgacttgt 

i ' i 1200 

agc:ctgac:tttgcaaatatttgaagttgtgcctgctgacacaacttcaatctggaagt6ctgttgatcagttttgcca 

; . ^ 1280 

<^aaacagcaagcagc:tatatatatctgtcacgagacac::tgc:gccct:ttctttc:cgccattcc:tc::taccctt 

' i : ; L 1360 
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FIG. 35 ICONTl 



Dol CDPKoene Mao n >iii65) 



Sit and Sequence 



CAAAATCTAGAAACCTTTTTTTTTCCTCCCSATACGCCCCTCCATCTCTCGCCSTTCATGTCCGTGGCTGGCTGCCCTCC 



1440 



^mflNAstart-' 



gtgggagcaggcggccgcactcgttccccgccgcagccatgggcca6Tgctgctc:aagggcgccggagaggc::cgcca 



1520 



SXON 1 



cggaggcgccaaacggcaggcgccaagccgcgggcgtccgcgaacaacgccgacggacaacgsgcgtcgtcctcgtccgc 



1600 



EXON 1 



igtggctgctgccgctgctgctgccggtggtggtggcggcggcacgacgaagccggcctcacccaccggcggcgccaggg 



1680 



EXON 1 



CCAGCTCCGGCAGCAAACCGGCGGCGGCCGTGGGCACGGTGCTGGGCCGGCCCATGGAGGACGTGCGCGCGACCTACTCG 



1760 



EX0N1 



Ava i 

ATGGGCAAGGAGCTCGGGCGCGGGCAGTTCGGCGTGACGCACCTGTGCACGCACCGGACGAGCGGCGAGAAGCTGGCGTG 



1840 



EXON 1 



CAAGACGATCGCGAAGCGGAAGCTGGCGGCCAGGGAGGACGTGGACGAC3TGCGGCGGGAGGTGCAGATCATGCACCACC 



1920 



EXON1 



tctccggc:ag:::aacgtggtgggcctccgcggcgcgtacgaggacaagcagagc3Tgcacctcgtcatggagctgtgc 



2000 



EXON 1 



Ava I 

gcgggcggggagctcttcgaccgcatcatcgcccggggccagtacacggagcgcggcgccgcggagctgctgcgcgccat 



2080 



EXON 1 
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FIG. 35 (CONT) 

pol CDPK gen Map (^ > 4165) Site and Sequence 



Ava I 

CGTGCAGATCGTGCACACCTGCCACTCCATGSGGGTSATGCACCGSGACATCAAGCCCGAGAACTTCCTGCTGCTCAGCA 

^ ; ; ^ L 2160 



cXON 1 



AGGACSAGGACGCGCCGCTCAAGGCCACCGACTTCGGCCTCTCCGTCTTCTTCAAGGAGGSCGAGCTGCTCAGGGACATC 
1 ; • ^ ' 22^0 



£XON 1 



Ava I 



GTCGGCAGCGCCTACTACATCGCGCCCGAGSTGCTCAAGAGGAA6TACGGCCCGGAGGCCGACATCTGGAGCGTCGGCGT 

■ ' . i ; ■ ■ i ^ L 2320 



£XON 1 



3am H I 

CATGCTCTACATCTTCCTCGCCGGCSTGCCTCCCTTCTGGGCAGGTCGGATCCGTCCGTGTTCGTCCTAGACGATATACA 



2^00 



EXON 1 ' INTRON 1 




GAACCCGACGATGGATTTGCTTCTCAGCCCTGTTCTTGCATCACCAGAGAACGAGAACGGCATCTTCACCGCCATCCTGC 
^ i ; i ■ ' ^ 2480 



iNTRON 1 



EXON 2 



GAGGGCAGCTTGACCTCTCCA6CGA6CCATGGCCACACATCTCGCCGGGA6CCAAGGA7CTCGTCAAGAAGATGCTCAAC 
^ ^ i 1 ' ■ ■ ' ■ ■ i ^ 2560 



£X0N2 



ATCAACCCCAAGGAGCGGCTCACGGCGTTCCAGGTCCTCAGTAAGTACCCAGATCGTTGCTGTCATACACTCATATGAAT 

; ' i « . 1 ^ u 2640 




EXON 2 ' INTRON 2 



TGTATCGTTCATGA6CAACGATCGA6CGGATTTGGTGAACTTGTA6ATCACCCATGGATCAAAGAAGACGGAGAC5CGCC 
■ ■ ^ ^ ^ 1 ^ ; u 2720 



INTRON 2 



3f- 



EXON 3' 



TGACAC3CCGCTTGACAACGTTGTTCTCGACAGGCTCAAGCAGTTCAGGGCCATGAACCAGTTCAAGAAAGCAGCATTGA 

■ ■ ' ' ' " ' ' ■ . ■ ■ ^ 2800 

EXON 3 — 
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FIG. 35 (CONT) 

poiCPPKoen Map M >4165) SHe and Sequence 

GG5TAC.ATTATCTGATAAAAGCTCCACAAATACAACTTCT5AAGAACAGCAATGCTTACAC3GCAGAATTTTCATTATAA 

; 1 : . : ^ 2B80 




INTH0N3- 



ATGCTCTTGATGACATAATGTTAGATCATAGCTGGGTGCCTATCCGAAGAGGAGATCACAGGGCTGAAGGAGATGTTCAA 
i 1 : 1 u 2960 




iNTRON 3 1 EXON 4 



GAACATTGACAAGGATAACAGCGG6ACCATTACCCTC3ACGAGCTCAAACACGGGTTG6CAAA6CACGGGCCCAAGCTGT 
^ ; , 1 : 2Q^Q 



EXON 4 



CAGACAGCGAAATGGAGAAACTAATGGAAGCA6TGAGTTTTCAGAGTACAATCTTAAAAAAAGGAATTGTGATTCTTTTC 
> : ^ ' ■ ' 3120 



EXON 4 ! INTRON 4 



AAAATGAAGAAGTAATCTGAAAACATCCCTGCTGAAATGCTTTATACATTTCCAGGCTGACGCTGACGGCAACGGGTTAA 

■ — ' ' ' ' ' — ' — ' ' ' ' ' ^ 3200 



INTRON 4 1 EXON 5 



^EcoRI 

GACTAC3ACGAATTCG7CACCGCAACAGTGCATATGAACAAACTGGATAGAGAAGAGCACCTTTACACAGCATTCCAG 



3280 



EXON 5 



^EcoRI 

ATTTCGACAAGGACAACAGCGGGTAAGTTGAACGTTAAAATGATACAGCTGGTACCTGAATTCTGGACAACACATATCA 



3360 



EXON 5 ^ INTRON 5- 



lAACAGGACACATATATAATTCGTTTATCTCACAGGTACATTACTAAAGAAGAGCTTGAGCACGCCTTGAAGGAGCAAGG 

^ ^ — ^ : ^ 3^i|Q 



INTRON 5 ^ EXON 6- 



3TTGTATG AC3CCGATAAAATCAAAGACATCATCTCCGATGCCGACTCTGACAATGTAAGGAACAAACATTATTTAAATT 
' ' ' ^ ^ ^ 3S20 




EXON 6 1 INTRON 6 
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FIG. 35 (CONT) 

polCPPKowia Mao M >4165) Srte and Sequence 

TCAGCCCACAAACTAAACTATAGAAACCACATCATGATATCAAATTTTGAGGTGGCGGTGCTACAGAAATAGAACCCAGT 

■ ' ' ' ^ ^ 36CW 



»r4TR0N 6- 



ACACCAAAATGACTAACTTGTCATGATTAGTTGTTCCTCGTAACTGAACATTTGTGTTCTTAGTTTCTTATTGTTAAACC 
• . i ^ 3680 



INTRON 6' 



AAA6ACTTAAATTCACTTTTGCACATGCAGGATGGAAGGATAGATTATTCAGAGTTTGTGGCGATGATGAGCAAAGGGAC 
^ i ' ' ' ^ 3760 




INTRON 6 ^ EXON 7 



GGCTGGTGCCGAGCCAATGAACATCAA6AAGAGGCGA6ACATAGTCCTATAGT6AAGTGAAGCAGWAAGTGT6TAATGTA 

I i ■ I ' . , I ■ [ I ; I I I 3gi>0 



EXON 7' 



ATGTGTATAGCAGCTCAAACAAGCAAATTTGTACATCTGTACACAAATGCAATGGGGTTACTTTTGCAACTTAGTTCATG 
■ ' ' ■ ■ ■ i 3920 

GATGGTTGTGTACGTTGTGCTATTGATTGCAAGTGATTTGAAAGACATGCATACTTAGGAACTGAGAAAGATA6ATCTAC 

TACTGCTAGAGACAGAACAATAGGATKKYAATTCAGYAAGTGYGTATTTCAGAAGACTACAGCTGGCATCTATTATTCTC 

' ■ ■ ' ' ■ ' ' ' ' ' .I t I , r 4080 

ATTGTCCTCGCAAAAATACTGATGATGCATTTGAGAGAACAATATGCAACAAGATCGAGCTCCCTATAGTGAGTCGTATT 

; , , ; , ; ^ ^ ^^gQ 

aGGCC 

4165 
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\ar2-PAT.SEQ 9/21/92 

NAME: pCrB55l5 "^415 3PS DNA CIRCULAR UPDATED 9/21/92 

DESCRIPT'^ON: pGFl with synchecic geiser fix. 
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1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCrACA ACTGCCTGAG CAACCCCGAG 
MerAapAsn AsnProAan ZleAsnGluCya IlePrcTyr AsnCysLeu SerAanProGlu 

ValGiuVai Z.euGlyGly GluArglieGlu rhrGiyTyr ThrProIle AspXleSerLeu 

121 AGCCTGACCC ACTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerLeuXhr GlnPheLeu LeuSerGIuPhe ValProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TC7GGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
ValAapIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlie 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GluGlnLeu XleAsnGln ArgXleGluGXu PheAXztArg AanGlnAXa IleSerAzgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC GCCGAGAGCT TCCGCGAGTG GGAGGCCGAC 
GluGlyLeu SerAanLeu TyrGlnlleTyr AlaGluSer PheArgGlu TrpGluAlaAsp 

361 CCCACCAACC CCGCCCTGCG C3AGGAGATG CGCATCCAGT 7CAACGACAT GAACAGCGCC 
ProThrAan ProAiaLeu ArgGluGluMer ArglleGln PheAsnAsp MetAanSexAia 

421 CTGACCACCG CCATCCCCCT G7TCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
LeuThrThc AiaXlePro LeuPheAlaVai GlaAanTyr GlnVaiPro LetiLeuSerVal 

481 TACGTGCAGG CCGCCAACCT GCACCTGAGC GTGCTGCGCG ACGTCAGCG7 GTTCGGCCAG 
TyrValGln AiaAlaAan LetiHial^euSer VaiLeuArg AapValSer ValPheGlyGia 

541 CGCTGGGGCr TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
ArgTrpGly PheAspAla AiaXhrXleAan SerArgTyr AanAspLeu ThrArgLeuXle 

601 GGCAACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCrCGAGCG CGTGTGGGGT 
GlyAanTyr ThrAspHis AlaValArgTrp TyrAanThr GlyLeuGlu ArgValTrpGly 

661 CCCGACAGCC GC5ACTGGAT CAGGTACAAC CAGTrCCGCC GCGAGCTGAC CCTGACCGTG 
ProAspSer ArgAapTrp XleArgTyrAan GlnPheArg ArgGluLeu ThrLeuThrVal 

721 CTGGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
LetiAapIle ValSerLeu PheProAsnTyr AapSerArg ThrTyrPro XleArgThrVai 

7 81 AGCCAGCTGA CCCGCGAGAT TTACACCAAC CCCGTGCrGG AGAACTTCGA CGGCAGCTTC 
SerGlnLeu ThrArgGiu XieTyrThrAan ProVaiLeu GluAsnPhe AspGlySerPhe 

841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 
ArgGlySer AiaGlnGly IleGluGlySer IleArgSer ProHiaLeu MetAapIleLeu 

901 AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAG7 ACTACTGGAG CGGGCACCAG 
AanSerXle ThrXleTyr ThrAapAlaHia ArgGlyGlu TyrTyrTrp SerGlyHiaGln 

961 ATCArGGCCA GCCCCGTCGG CTTCAGCGGC CCCGAGTTCA CCrrCCCCCT GTACGGCACC 
IleMetAla SerProVai GlyPheSerGly ProGluPhe ThrPhePro LeuTyrGlyThr 

1021 ATGGGCAACG CTGCACCTCA GCAGCGCATC GTGGCACAGC TGGGCCAGGG ACTGTACCGC 
MerGlyAan AlaAlaPro GinGlnArgXie VaiAlaGln LeuGlyGln GlyVaiTyrArg 

lOai ACCCTGAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAanXle GlylieAan AanGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GrTCGCCTAC GGCACCAGCA GCAACCTGCC CAGCGCCGTG 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAanLeu ProSerAiaVai 

1201 TACCGCAAGA GCGGCACCGT GGACAGCrTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
TyrArgLya SerGlyThr ValAapSerl-eu AapGluIle ProProGln AanAanAanVal 
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12 61 CCACCTC3AC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATGTTCCG CAGTGGCTTC 
ProProArg GinGiyPhe SerHisArgLeu SerHiaVai SerMetPhe ArgSerGlyPhe 

1221 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTrCA GCTGGATTCA CCGCAGTGCC 
SerAsnSer SerVaiSer IlelleArgAla ProMetPhe SerTrpIle HiaArgSerAla 

1381 OAGTTCAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCCTGAC CAAGAGCACC 
GluPheAsn Aanllelle ProSerSerGln IleThrGln IleProLeu ThrLysScrThr 

14 41 AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
AanLeuGly SerGlyThr SerValValLya GlyProGly PheThrGly GlyAapXleLeu 

1501 CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACAXCACCGC CCCCCTGAGC 
ArgArgThr SerProGly GinlleSerThr LeuArgVal AanlleThr AiaProLeuSer 

L561 ^GCGCTACC GCGTCCGCAT CCGCTACGCC AGCACCACCA ACCTGCAGTT CCACACCAGC 
GlnArgTyr ArgValArg IleArgTyrAia SerThrThr AsnLeuGln PheKiaThrSer 

1621 ATCGACGGCC GCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
XlftAspGly AxgProIle AsnGinGlyAan PheSerAla ThrMetSer SerGlySerAro 

1681 CTGCAGAGCG GCAGCTTCCG CACCGTGGGC TTCACCACCC CCTTCAACTT CAGCAACSGC 
LeuGlnSer GlySerPhe ArgThrValGly PheThrThr ProPheAan PheSerAanGly 

1741 AGCAGCGTGT TCACCCTGAG CGCCCACGTG TTCAACAGCG -GCAACGAGGT GTACATCGAC 
SerSerVai PheThrLeu SerAlaHiaVai PheAanSer GlyAanGlu VaiTyrlleAap 

1801 CGCATCGAGT TCGTGCCCGC CGAGGTGACC TTCGAGGCCG AGTACGACCT GGAGAGGGCT 
ArglleGlu PheVaiPro AlaGluValThr PheGlxiAXa GluTyrAsp LeuGiiiArgAia 

1861 CAGAAGGCCG TGAACGAGC? GTTCACCAGC AGCAACCAGA TCGGCCTGAA GACCGACGTG 
GlnLyaAia ValAanGlu LeuPheThrSer SerAanGin IleGlyLeu LyaThrAspVal 

1921 ACCGACTACC ACATCGATCA AGTATCCAAT TTAGTTGAGT GTTTATCTGA TGAATTTTGT 
ThrAspTyr HialleAap GlnValSerAsn LeuValGlu CyaLeuScr AapGluPheCya 

1981 CTGGATGAAA AAAAAGAATT GTCCGAGAAA GTCAAACATG CGAAGCGACT TAGTGATGAG 
LeuAspGiu LyaLyaGiu LeuSerGluLys ValLysHis AlaLyaArg LeuSerAspGlu 

2041 CGGAATTTAC TTCAAGATCC AAACTTTAGA GGGATCAATA GACAACTAGA CCGTGGCTGG 
ArgAsnLeu LeuGinAap ProAanPheArg GiylleAan ArgGlnLeu AapArgGiyTrp 

2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAATTACGTT 
ArgGlySer ThrAapIle ThrlleGlnGly GlyAapAap VaiPheLya GluAanTyrVai 

2161 ACGCTATTGG GTACCTTTGA TGAGTGCTAT CCAACGTATT TATATCAAAA AATAGATGAG 
ThrLeuLeu GiyThrPhe AapGluCysTyr ProThrTyr LeuTyrGln LyalieAapGlu 

2221 rCGAAATTAA AAGCCTATAC CCGTTACCAA TTAAGAGGGT ATATCGAAGA TAGTCAAGAC 
SerLyaLeu LysAlaTyr ThrArgTyrGln LeiiArgGly TyrlleGlu AapSerGlnAap 

2281 TTAGAAATCT ATTTAATTCG CTACAATGCC AAACACGAAA CAGTAAATGT GCCAGGTACG 
LeuGiuIle TyrLeuIle ArgTyrAanAla LyaHiaGlu ThrVaiAan ValProGlyThr 

2341 GGTTCCTTAT GGCCGCrTTC AGCCCCAAGT CCAATC-GGAA AATGTGGGGA GCCGAATCGA 
GlySerLeu TrpProLeu SerAlaProSer ProIleGly LyaCyaGly GluProAanArg 

2401 TGC3CTCCGC ACCTGGAGTG GAACCCGGAC CTAGACTGCA GCTGCAGGGA CGGGGAGAAG 
CyaAlaPro HiaLeuGlu TrpAanProAap LetiAapCya SerCysArg AapGlyGiuLya 

24 61 TGCGCCCATC ATTCCCATCA rrrCTCCTTG GACATTGATG TT3GATGTAC AGACTTAAAT 
CyaAlaHia HiaSerHia HiaPheSerLeu AapIleAap VaiGlyCys ThrAspLeiiAan 

2521 GAGGACTTAG GTGTATGGGT GATATTCAAG ATTAAGACGC AAGATGGCCA TGCAAGACTA 
GiuAspLeu GlyValTrp VaillePheLya lieLyaThr GlnAapGly HiaAlaAxgLeu 

2581 GGAAATCTAG AATTTCTCGA AGAGAAACCA TTAGTAGGAG AAGCACTAGC TCGTGTGAAA 
GlyAanLeu GluPheLeu GluGluLyaPro :*euValGiy GiuAlaLeu AlaArgValLya 
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2641 AGAGCGGAGA AAAAATGGAG AGACAAACCT 
ArgAlaGlu LysLysTrp ArgAapLyaArg 

2701 TATAAAGAGG CAAAAGAATC TGTAGATGCr 
TyrLyaGlu AlaLysGlu SerValAspAla 

2761 CAAGCGGATA CCAACATCGC GATGATTCAT 
GlnAlaAsp ThrAsnlle AlaMecIleHis 

2821 GAAGCTTATC TGCCTGAGCT GTCrSTGATT 
GluAlaXyr LeuProGlu LeuSerVaiZle 

2881 TTAGAAGGGC GTATTTTCAC TGCATTCTCC 
liCuGluGly ArgllePhe TtirAlaPheSer 

2941 GGTGATTrrA ATAATGGCTT ATCCTGCTGG 
GlyAspPhe AsnAsnGly LeuSerCyaTrp 

3001 CAAAACAACC ACCGTTCGGT CCTTGTTGTT 
GlnAanAsn HisArgSer ValLeuVaXVal 

3061 GTTC3TGTCT GTCCGGGTCG TGGCTATATC 
ValArgVal CyaProGly ArgGlyTyrlle 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC 
GlyGluGly CysrVaiThr IleHisGluIle 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC 
AanCysVal GluGluGlu ValTyrProAan 

3241 ACTCAAGAAG AATATGAGGG TACGTACACT 
ThrGinGlu GluTyrGlu GlyThrTyrTbr 

3301 GAAAGCAATT CrTCTGTACC AGCTGATTAT 
GluSerAan SerSerVal ProAiaAapTyr 

3361 GATGGACGAA -GAGACAATCC TTGTGAATCT 
AapGlyArg ArgAapAan ProCyaGluSer 

3421 CCAGCTGGC? ATGTGACAAA AGAATTAGAG 
ProAlaGly TyrValThr LyaGluLeuGlu 

3481 GAGATCGGAG AAACGGAAGG AACATTCATC 
GluIXeGiy GluThrGlu GlyThrPheXXe 

3541 GAATAA 
Glu 



GAAAAATTGG AATGGGAAAC AAATATTCTT 
GluLysLeu GluTrpGlu ThrAanlleVal 

TTATTTGTAA ACTCTCAATA TGATAGATTA 
LeuPheVal AsnSerGXn TyrAapArgLeu 

GCGGCAGATA AACGCGTTCA TAGCATTCGA 
AlaAlaAap LyaArgVal HisSerZltaArg 

CCGGGTGTCA ATGCGGCTAT TTTTGAAGAA 
ProGlyVal AanAlaAla IlePheGluGXu 

CTATATGATG CGAGAAATGT CATTAAAAAT 
LeuTyrAsp AlaArgAsn VaXIleLyaAan 

AACGTGAAAG GGCATGTAGA TGTAGAAGAA 
AsnValX.ys GlyHisVaX AspVaXGXuGXu 

CCGSAA7GGG AAGCAGAAGT GTCACAAGAA 
ProGluTrp GXiiAXaGXu VaXSerGXnGlu 

C77CGTG7CA CAGCGTACAA GGAGGGATAT 
LeuArgVaX ThrAXaTyr LysGXuGXyTyr 

GAGAACAATA CAGACGAAC? GAAGTTTAGC 
GXuAanAsn ThrAspGXu LeuLyaPheSer 

AACACGGTAA CGTGTAATGA TTATACTGCG 
AanThrVaX ThrCysAan AapTycThrAXa 

TCTCGTAATC GAGGATATGA CGGAGCCTAT 
SerArgAsn ArgGXyTyr AapGXyAXaTyr 

GCATCAGCCT ATGAAGAAAA AGCATATACA 
AXaSerAXa TyrGXuGXu LysAXaTyrThr 

AACAGAGGAT ATGGGGATTA CACACCACTA 
AanArgGly TyrGlyAap TyrThrProLeu 

TACTTCCCAG AAACCGATAA GGTATGGATT 
TyrPhePro GXuThrAsp LyaVaXTrpIXe 

GTGGACAGCG TGGAATTACT TCTTATGGAG 
VaXAapSer VaXGXuX^eu LeuX«euMetGXu 
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